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Equivalent Circuits of Linear Active Four-Terminal Networks’ 
By 
LISS C. PETERSON 


INTRODUCTION 


HE art of equivalent network representation has grown very con 

siderably since its inception by Dr. G. A. Campbell. In his paper 
“Cissoidal Oscillations” which was published in 1911 he proved that any 
passive network made up of a tinite number of invariable elements and hay 
ing one pair of input terminals and one pair of output terminals is externally 
equivalent to an unsymmetrical T or I] network. From this modest be- 
ginning the field of applications of the equivalent circuit concept has steadily 
expanded so that by now the whole field of linear passive circuit theory has 
been subjected to equivalent circuit interpretation. 

With the advent of the thermionic vacuum tube amplitler, linear active 
network theory had to be considered and almost immediately the attempt 
was made to obtain an equivalent circuit whose performance would depict 
the linear characteristic of the tube. The equivalent circuit art has also, 
in recent years, been used to describe the performance of certain classes of 
ach. non-linear devices, such as mixers, and further applications in this field will 
no doubt be made. 

Equivalent circuit concepts have played an important part in electrical 

* This paper appears substantially as it was originally prepared some vears ago as a 
technical memorandum for internal distribution within Bell’ ‘Telephone Laboratories 
Its publication is rendered timely by the applic ability to the recently announced transis 
tor devices. Present experience indicates, in fact, that the configuration of Fig. 13 
furnishes the most useful equivalent four-pole network for the transistor 

Mr. J. A. Morton has called my attention to an early paper in this tield by Strecker and 
Feldtkeller (E.N.T. Vol. 6, page 93, 1929) in which the general theory of active networks 


has been well developed. However, the early state of the then prevailing art prevented the 
full demonstration of the power of the method and of course precluded the possibility of 





application to modern devices. This paper enlarges the general theory and makes logica 
applications of the method to modern devices Since the appearance the Str 


ol ecKeT 
“ele eller paper several other papers tou¢ it g upon this subject have appearec ow 
Feldtkell | oth I I j I | i. 


ever, no attempt at giving a complete bibliography e made, except to call attentior 

to the contributions of Prof. M. J. O. Strutt, who also has adopted the four-pole point of 
view 

The I.R.E. Electron Tube Committee has adopted the four pole viewpoint ar d has 

proposed methods of tests for experimentally determining the four-pole parameters of ele 

tron tubes. This material will be published in the new I.R.E. standards on electror 

5. A. tubes, 
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engineering, particularly in communication engineering. One might alrmos 
say that a problem is not solved unless an equivalent circuit has been { 
whose performance will exhibit some of the characteristics of the a 
problem. This need for circuit concepts reflects a desire to make 
phenomena more alive and subject to physical interpretation, for it is 
that equivalent circuit concepts have greatly contributed towards physical 
interpretation of analytical expressions. A problem may very wel 
studied without recourse to the equivalent circuit concept and a cor 
answer obtained, yet the development of an equivalent circuit adds greatly 
to the complete interpretation of the physical phenomena. 
In this paper we shall be concerned only with linear a-c amplifier operat 


where the term linear indicates that the analytical expressions connecting 
currents and voltages are linear, that is, involve only the first power of 
instantaneous current or its derivative. We shall further restrict our atte: 
tion to the usual mode of four-terminal amplifier operation in which one pai! 
of terminals is associated with the signal to be amplified and the other pair 
with the amplified signal. The equivalent a-c circuit of such a transducer 
will require the development and interpretation of the linear relations 
connecting the a-c currents and voltages at the input terminals with corre 
sponding quantities at the output terminals. At this point we can logica 
postulate that the important formal difference between an active and 
passive four-pole lies in that the law of reciprocity no longer can be assume: 
to hold for the active network. Since passive four-poles require three inde 
pendent parameters for their complete specification the active four-pol 
will require at least one additional parameter. 

In the practical applications we shall be principally concerned with t 
various triode four-pole connections. A short review of the various stages 
involved in deriving the newer forms of equivalent triode circuits will ther: 
fore be considered prior to the introduction of generalized concepts. Su 
a.review is in the main concerned with the effect of frequency upon the 
early forms of the equivalent triode circuit. 

In the review we shall confine ourselves to the usual grounded cathod 
mode of operation since it is only in recent times that grounded grid and 
grounded plate operation have come into use. This distinction is, howeve: 
not necessary and is introduced solely for simplicity 

The conventional notation as well as the positive current directions ar 
indicated on Fig. 1 for a general four-pole N. It is assumed here that termi 
nals 1 are the available input and terminals 2 the available output terminals 
This choice of current direction appears to the writer to be the most con 
venient to use when the four-pole is erergized at the input terminals 1 only 

Consider, now, a grounded triode operated at such a low frequency that 
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displacement or capac ity currents can be disregarded, and let it first be 
posed that the grid is negatively polarized with respect to the ¢ athode so 
grid current is absent. This represents the most primitive form oi 
eration, governed by a law which has been termed ‘The equivalent-plate- 
ircuit Theorem.” With reference to the assumed current direction this 
eorem may be expressed by saying that the application of the voltage 1°; to 
he grid is equivalent, so far as phenomena in the plate circuit are concerned, 
to the application of the voltage —w1l; in series with a resistance r, , where 


u is the amplification factor and r, the internal plate resistance. The 


® @ 
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Fig. 1—Current-voltage relations for a general four-pole 
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Fig. 2—Equivalent circuit of a negative grid triode at low frequencies 


equivalent circuit for this mode of operation is thus as shown in Fig. 2, where 
the input terminals 1 are represented by the grid G and the cathode C and 
the output terminals by the anode P and the cathode C. In terms of analy- 
sis the circuit is described by the two equations 


(1) 


Equations (1) and their associated circuit Fig. 2 are expressions of the 
fact that in any closed loop the voltages must be in equilibrium. 
By a slight rearrangement of (1) a network representation based on current 


' Chaffee, “Theory of Thermionic Vacuum Tubes,” page 192 
’ ) g 
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equilibrium may be obtained. For this purpose (1) is written as 


/ () 


Phe corresponding network representation is as shown in Fig. 3, where 


energizing source in the plate circuit consists of a constant current gener 


: M , , sad 
of strength I’; impressed across the output terminals 2. 
. 
G ® 
sit | 
l, GS Tp lo 
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ta negative grid triode at low frequencies 
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Fig. 4—Equivalent circuit of a positive grid triode at low frequencies 


Let us now. consider a further step and assume that the grid is positive s 
that grid current is also flowing. In regard to the grid circuit there is 
theorem called the **Equivalent-grid-circuit Theorem” which is exactly 
similar to the corresponding plate circuit theorem. The theorem says that 
the a-c grid current can be calculated by assuming that an e.m.f. yu, | 
acts in series with a resistance 7, where uw, is the reflex factor and r, the 
internal grid resistance. In symbols and by using the notation of Fig. 1 


this is expressed by: 


By combining the two theorems the equivalent circuit of Fig. 4 results 
Again by writing (3) as 


a corresponding network based upon current equilibrium may be found. 
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It is well to note that this general low-frequency case is governed by the 


two simultaneous equations 


1 uu 
l, | j 
} } 
5) 
Le 1 
Is | | 
, } 
which are of the general form 
/, y] l | 1 T ) J 
0 
/ Do; j d | 
where 
1 
By 3 
$ ; 
t 1 
O 
} } 


The network of Fig. 4 is thus a possible form of circuit interpretation of 
(5) or (6). It may also be observed that (6) represents at least formally 
the most general formulation of the linear active four-pole so that even at 
very low frequencies the general four-pole point of view might be useful 

Several observations may now be made. In the first place it should be 
noted that these networks are not based on any study of the internal action 
of the tube, but rather on the purely formal mathematical process of dif- 
ferentiating the two functional relations which express the broad fact that 
plate and grid currents are some unspecified linear continuous functions of 
the grid and plate potentials in the neighborhood of the operating point. 

In the second place it may be observed that the network of Fig. 4 repre 
sents in a sense two separate networks interacting with each other by means 
of voltage or current generators. This method of equivalent circuit repre- 
sentation is the result of separate interpretation of the equivalent plate and 
grid circuit theorems. Asa corollary it follows that such a four-pole equiva 
lence involves at least two generators within the network in order to take the 
effect of interaction into account. 

We may say that the networks discussed were satisfactory so long as the 
frequency was low enough to allow displacement currents to be disregarded 
With the operation of circuits at higher frequencies (up to the order of 10° eps, 
say) it became necessary to take the internal tube capacitances into account. 
This was done by the superposition of a capacity network as shown in Fig. 
5. It is interesting as well as instructive to formulate this network transi- 
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tion in analytical terms. ‘The transition rests upon the physical fact that the 
total current entering or leaving an electrode is the sum of conduction and 
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Fig. 5—Equivalent circuit of a positive grid triode at moderately low frequencies. 
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Fig. 6—Parasitic triode network 


displacement current. The network for the displacement currents is passive 
and is shown on Fig. 6. For this network: 


hh = BV + BiV2 
: ’ . (8) 
I, = -Bi2Vi + BooV 2 


, ' . , . _— - e 
where J; is input and /2 the output displacement current. The coefficients 
appearing have the values 


Bir = iw(Coc + Cop) 
Bie = —iwCop > (9) 
Boo = —iw(Cpe + Cop) 


The potentials appearing in (8) are the same as those which occur in the 
conduction current equations (5), this being the physical condition which 
also must be satisfied. By adding (5) and (8) and by letting 7; and /, 
mean total currents we get: 


qT; = | + 1w(¢ gc + C »)| | i. E aa lw ¢ > | 2 { 
"9 "g 


> (10) 

. 1 ; > i. 

Ie _ | - + iC | J  Ricoas E + iw(C pe + Cw) Ve | 
Pp 
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These equations are again of the form 


l, == Bi Vy + Bio V4 


; 11) 
Ir = Bn Vi + Bo Ve 
where now 
1 
31) + tw(¢ t Cop) 
r, 
342 t tw, | 
| 12) 
in = — © + ial, 
t; 
l or . 
39 = — “Ft tnGCs. 7 ¢ ov) 
ft; 


The network interpretation of Fig. 5 is consistent with (10) and so does, 
in fact, constitute a possible network representation, 
From (12) the relation 


bn + Be = — E . te 13) 


is obtained. The thing to emphasize is that this sum is independent of the 
passive feedback admittance and that it is a constant independent of fre- 
quency within the range considered. It is, moreover, a quantity which 
only is correlated with the conduction currents in the tube. For lack of a 
better name the sum (13) will be referred to as “the effective transconduct 
ance.”’ It will play an important role in the later discussion. 

As still higher frequencies (above 10’ eps) were employed it necame neces 
sary to take into account lead effects, usually in the form of self and mutual 
inductances, and by incorporating them in the network of Fig. 5 a slightly 
more involved circuit was obtained. This more general network is still 
determined by equations of the form (11), for adding the new network ele- 
ments merely means that linear transformations are applied to the potentials 
V, and WV» with the result that a new set of 6-coeTicients is obtained, the 
new set being merely of somewhat more complicated form than the first. 

With the utilization of frequencies so high that the electron transit times 
became comparable with the period of the applied signal, further complica- 
tions arose and the equivalent network idea was put to a severe test. In 
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this development we may distinguish between two methods of approach 
In one the attempt was made to modify the low-frequency network of Fig 
5 to include transit time effects to a first order of approximation.? Th 
second approach differed in that attention was directed only toward thi 
electron stream itself, while the circuit elements connecting the stream wit! 
the physically available terminals were grouped together with the externa 
circuit elements.’ The latter approach represented a particularly usefu 
one from a physical point of view and it also extended the use of basic circuit 
elements to include the general diode impedance as a new circuit element 
complete in itself. However, even in this latest approach the four-pol 
point of view was not adopted, with consequent loss of generality and unity 
in viewpoint. Moreover, the fact that only the electron stream itself was 
considered caused some confusion. 

With this brief review of the development of equivalent circuit representa 
tion of vacuum tube amplifiers in mind we turn now to the main body of the 
paper in which a more general treatment of the problem is considered. It 
will be shown, in the coming sections, how it is possible to lump all the 
factors involved in vacuum tube amplifiers, i.e., physical circuit parameter 
and internal electronic effects involving the electron transit time, into a 
single coordinated picture with an equivalent circuit representation of the 
overall effect. 


EQUIVALENT CIRCUIT REPRESENTATION OF ACTIVE LINEAR FouR-POLt! 
PoLE EQUATIONS 


Whenever the response of a general transducer is related in a linear manner 
to the stimulus, the transducer behavior is described by two linear relations. 
Although we are primarily concerned with electromagnetic transducers the 
concepts to be used are of broader utility and may, for example, also be 
applied to mechanical and electromechanical transducers. 

There are various ways in which the behavior of the four-pole may be 
expressed analytically. The form expressing current equilibrium has al 
ready been given and it may well serve as a starting point for the following 
discussion: 


Thus we have: 


l, : Biv, T Bi2Vo 
, , (14) 
I; BouVi + BoVs 


> F. B. Llewellyn, “Electron-Inertia Effects,’ Cambridge University Press, 1941. 

F. B. Llewellyn and L. C. Peterson, Interpretation of Ultra-High Frequency Tube 
Performance in Terms of Equivalent Networks, Proceedings of the National Electronics 
Conference, 1944 
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EQUIVALENT CIRCUITS Ol VETWORAS 1} 
s expressions for current equilibrium at any frequency Phe four param 
ters 8 which appear have simple physical meanings, and it is seen that 
31, is the input admittance with output shorted 


—B» is the output admittance with input shorted 


-By is the feedback admittance with input shorted 
3; is the transfer admittance with output shorted 

Before proceeding to the network representations of (14) it seems well 
to state brietly some of the reasons which almost force one to adopt the four- 


pole point of view when dealing with vacuum tubes in the higher frequency 


range. 


There are the pedagogical reasons that classical methods long employe: 
for passive networks are merely extended into the realm of active networks, 
thus providing unity in viewpoints. 

The basic analysis involving the four-pole parameters for a particular 
transducer needs to be performed only once and, once obtained, all problems 
involving terminal impedances may, in any particular case, be solved in a 
routine manner. 

There are also further practical reasons. We saw above that with in 
crease in frequency the classical equivalent network had to be moditied to a 
considerable extent in order to include the parasitic elements. This poses 
a serious problem for the tube designer, whose task it is to estimate the tube 
performance between known terminations. Such a task based upon the 
modified classical circuit becomes very difficult and cumbersome. More 
over, it is also difficult to segregate and measure the parasitic elements. 
Hence it appears that one could gain much if design parameters could be 
developed capable of retlecting parasitic and transit time effects. Finally, 
it is desirable to develop equivalent circuits with a minimum number of 
parameters bearing simple relationships to quantities which can be measured 
directly. 

These general desires arising from the practical needs of the tube designer 
can be satisfied if tube behavior is specitied by means of four-pole parameters. 

All in all the four-pole point of view can be made to satisfy the logical 
needs of integrated concepts as well as the practical needs of simplicity in 
the specification of tube performance. 

After this brief presentation of the argument for the four-pole point of 
view, the network representation of (14) will now be considered. 

Stated in broadest terms: we are seeking a network representation by 
considering the two equations as a single unit and not by the trivial considera 
tion of each equation by itself. 
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We need, to begin with, the well-known network representation of a 
passive four-pole. Equation (14) has, then, the form 


I, BiiV, T ByVe 


(15 
rf BiVi + BoVe} 


and one equivalent circuit representation is that given by the IT network 
having the element values shown in Fig. 7. If so desired the II network 
can of course also be transferred into an equivalent T network. 


“Bia 


. ; | 
| i 


l |AueBr2 -(Bo2 -f312) 


saith e | 


Fig. 7—Equivalent circuit of a passive four-pole 
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Fig. 8—Equivalent circuit of an active four-pole; current impressed at the output. 


Now write (14) as 


I, = BuVi + BrV2 
(16) 
Io = —ByV1 + BeVo + (B12 + Ba)Vi1 
Whence it is seen by a comparison with (15) that the network representation 
of the active four-pole differs from the passive one merely by the presence 
of the impressed current (81. - 621)V1. A possible network representation 
of the general active four-pole is thus as shown on Fig. 8. 

An immediate application may be illuminating. Consider, for example, 
the triode operated with positive grid, with interelectrode capacitances 
taken into account. The four-pole equations are given by (11) and (12) 
and the classical network is that of Fig. 5. From (12) and Fig. 8 we get the 
network of Fig. 9. It may be observed that, while in Fig. 5 the source and 
source-free constituents are intermingled, this is not the case in Fig. 9 
where, on the contrary, a clear demarcation is present between such con- 
stituents. 
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In regard to the general network of Fig. 8 it may be noted that the 
.etwork is composed of two parts. One part obeys the reciprocal law and is 
represented by a II (or T) network and is consequently specified by three 
parameters. The other part is merely an impressed current controlled by 
the input potential V;. From the fact that the II (or T) network obeys the 
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Fig. 9—Equivalent circuit of a positive grid triode at moderately low frequencies: 
] £ | 
current impressed at the output. 
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Fig. 10—Equivalent circuit of an active four-pole; current impressed at the input 
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Fig. 11—Equivalent circuit of a positive grid triode at moderately low frequencies 
current impressed at the input. 


reciprocal law, the conclusion does not follow that it in general behaves as a 
passive network. The element values may, for example, be negative, and 
Bode’s integral relations may not necessarily be true for this network. 

It should further be noted that the network representation holds for all 
frequencies. The effect of frequency will be that the admittances in the 
network change and in changing they will, for example, reflect effects due to 
parasitics and electron transit time. 
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We next observe that the network of Fig. 8 is not unique; in other wor 
it is not the only possible network. ‘To see this it need merely be observe 
that from the group of four independent 3 parameters in (15) there ar 
several ways in which a subgroup may be selected containing only three o 
them. For example the “passive part” of the network in Fig. 8 was cor 
structed from the three parameters 811, B12 and Bo». 
the only choice. Moreover, the impressed force was taken to be voltage 
controlled but this again does not represent the only possibility. 

In general it follows that the “passive part” of the network will reflect 
at least three properties of the complete network. In Fig. 8, for example 
the “passive part’’ reproduces faithfully the two short-circuit driving-point 
admittances and the feedback admittance of the complete network. 


I‘inally it is well to observe that only one driving force is needed in the 


general network formulation. 


With this background of the general ideas involved let us now further 


explore some of the possibilities suggested as to other ‘passive network”’ 


constituents. Let us, for example, use 31; , 822. and 32; for the “passive part”. 
We then write (14) as 


= Buily — BaiVeo + (B12 + Ba) Ve 


(77) 


and from (17) the network representation of Fig. 10 follows. In addition 
this network differs from that of Fig. 8, in that the impressed current appears 
on the input side and that it is controlled by the output rather than by the 
input voltage. As an illustration consider again the triode operated with 
positive grid. The equivalent network is now as shown in Fig. 11. 

Now let it be supposed that the impressed force be current rather than 
voltage-controlled. We then first transform (14) into 


/ Bi ,, 
Vy, =— —- Ve 
B11 O11 
(18) 
bo ; 321 Bio 
n= 1,4 V2 (se _" *) 
O11 By 
and then rewrite (18) as 
Bre 
V; * h =" 
Bit Bry 
(19) 
Dio B21 By2 Biz + Be 
Is / T | 2 (s: — ) + I 
iy Bi Bir 


But thisis evidently not 
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be shown that the network representation of Fig. 12 is consistent with 


This network representation suffers from two obvious disadvantages. 

One is that the “passive network” is determined by only one short-circuit 
ving-point admittance of the complete network. The other short-circuit 
ving-point admittance of the “passive part” appears to be unrelated to 

vy simple admittance which may be found from measurements at the 
itput terminals of the complete network. The second disadvantage is 
it the coefficient of the current in the impressed force is of a complicated 
iture, since a driving-point admittance enters. Moreover, a closer 
nvestigation shows that in this network representation the “passive part” 


besides preserving the driving-point admittance 3,, and the feedback ad 





mittance By, also preserves the quantity A 3113 3 j232,;0f the complete 
vetwork. 
Biz Piz * Pas ,, 
ly — By to 
7 | a. 
[ ] Bi2tB +f34, 
V5 By +B> B22-By2 = lv. 
v t j \ Bi ’ 





| | 


Fig. 12—Equivalent circuit of an active four-pole; current impressed at the output 


By proceeding from (19) in a way similar to that used in (17) the network 
in Fig. 12 transforms into another in which the impressed force appears on 
the input side. Many other networks can also be found. 

The networks discussed were based on (14), which expressed the fact of 


current equilibrium and leads rather naturally to IT networks together with 


an impressed current source. On the other hand, starting with the four 
pole equations which express voltage equilibrium, one encounters T networks 
together with impressed electromotive forces. These networks must now 
also be considered. In regard to the details involved in their derivation we 
may be very brief since the methods are similar to those already employed 

The four-pole equations expressing voltage equilibrium may be written 


aS 


Vv; Zyl, + Zyle 
20 


V Zul, + Zool 


where current and voltage directions are assumed to be taken in accordance 


with the conventions of Fig. 1 
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rhe four parameters Z are of simple physical significance and it follows 
that: 
Z\; is the input impedance with output open 
Z~ is the output impedance with input open 
Zo is the feedback impedance with input open 
Z», is the transfer impedance with output open. 


The relations between the 8’s and the Z’s are given by the expressions: 


— 2 
Pi Pe 
Z2 
B2 = — 
Az 
(21 
Z 
by = — 
Az 
Zu 
Boe = = 
Az 
where 
Zn Z12 
Az = (22 
Zn ZL 
We have also 
l 
Az = (23 
Bs 
where 
Bi Pie 
A; = (24 
Bo Bae 


Applying now to (20) the transformations which led to the networks of 
Figs. 9, 10 and 12, we get the networks of Figs. 13, 14 and 15. 

The “passive part’’ of the network in Fig. 13 reproduces the two ope! 
circuit impedances Z,; and Z2 as well as the feedback impedance Zy of the 
complete network. 

The network in Fig. 14 differs from that of Fig. 13 because of the use of th: 
transfer impedance Z2; in the “passive part” with the result that the im 
pressed current appears on the input side. 
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(he “passive part” in Fig. 15, finally, preserves the open-circuit imped 
e Z,, the feedback impedance Z,. and the determinant A; of the com 
ete network. This network shows in identally a close resemblance to one 


ilready published.” 
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Fig. 14—Equivalent circuit of an active four-pole; voltage impressed in series with the 
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Fig. 15—Equivalent circuit of an active four-pole; voltage impressed in series with the 
output. 


It is well to emphasize that, while all the complete networks are equiva 
lent, this is not true for the ‘‘passive parts.” In fact from their derivation 
it follows that none are equivalent. Specific circumstances may make it 
desirable to perform transformations on the passive parts. For example, 
it might be more convenient to work with a I than with a T network. Such 
transformations are of course perfectly legitimate and raise the question of 
choice of equivalent networks. A few remarks on this subject may be 


* Loc. cit. 
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appropriate, since the choice is not quite a matter of indifference. Broa 
speaking, the choice will depend upon the relative advantages of nodal a 
mesh analysis and in most practical situations the former has prov 
to be the more convenient of the two. One cannot, however, be too d 
matic in this regard. Consider the networks of Figs. 8 and 13. Suppo 
for example, that parasitic elements appearing as a passive IT network | 
to be superimposed, It is then more conventient to use Fig. 8, not only 
account of the ease with which this may be done, but also on account 
the fact that the effective transadmittance 8) + Bo; is invariant with resp 
to such a superposition. If, on the other hand, parasitic elements app 
as series elements (lead inductances for example) the network of Fig. 13 
might be more convenient since the effective transimpedance Zp» + Z 
now remains invariant. 

It is also desirable to choose an equivalent network whose elements ar 
capable of being determined by simple measurements; from this consider 
tion the network on Fig. 8 is of distinct advantage. 


APPLICATION TO TRIODES 


The preceding section was primarily directed towards the development 
of possible forms of network representations of the general four-pole equ: 
tions. In this section one of these forms, namely that given in Fig. 8, wi 
be used to represent the three modes of triode operation. Depending upor 
which electrode is at a-c ground potential, we may distinguish between th« 


following methods of operation: 
1. Grounded cathode operation. 


2. Grounded grid operation. 


3. Grounded plate operation. 


The schematic diagrams, together with assumed voltage and current 
directions for these modes, are shown on Figs. 16, 17 and 18 respectively 

With a given set of available terminals the first step in obtaining the 
networks consists in calculating the four-pole parameters with respect. to 
these terminals. It will be assumed that the coupling circuits have beet 
designed with such efficiency that lead effects can be disregarded, so that the 
available terminals actually coincide with anode, grid and cathode. This 
set of available terminals brings us as close to the electron stream as it is 
physically possible to attain and it represents the ideal towards which desig: 
tends. 

It is beyond the scope of this paper to consider the details involved in the 
calculations of the four-pole parameters. The basic tools needed are th 


result.of a study of the dynamics of the electron stream, which started from 
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roa undamentals,’ and some familiarity with this work is assumed on the 
lal a of the reader. Concerning these tools two reservations need be made It 
Pro\ the first place the tools apply to planar rather than to cylindrical structures 
oO dl Since, however, there is a decided tendency toward planar structures 
Ippo especially in the high-frequency tield, because of a desire for uniform electrot 
rk | streams, this limitation does not seem serious. In the second place the tools 
nly are also subject to the limitation of a single-valued velocity electron stre 
unt 
‘eSpe : I> 
Appeal ———— 
‘ig. 13 G ; 
1. 7 = res v2 
{v: | | 
its are C ‘ . oC 
sider: Fig. 16—Vurrent-voltage relations for the grounded cathod 
$e 
pment ’ G 
equi 
8. WI 
,Upol I, 
on the Fig. 17—Current-voltage relations for the grounded grid 
F c 
fi oe ee | 
, ' 4 
iy \ j | V2 
ee 
irrent 
ively Fig. 18—Current-voltage relations for the grounded plate 
4 the 
ict to Thi, again, is not too serious, since one of the aims in present high-freque: 
bee: tube design is to produce as uniform a stream as possible. Nevertheless, 
it the the effects produced by multiple velocities are important to know. Studies 
This along such lines have been made by Mr. Frank Gray of these Laboratories 
HE Is The operating conditions of the triode are assumed to be quite general. 
esigi There are, for example, no restrictions placed upon frequency and space 
charge and the grid may, moreover, have either positive or negative 
n the potential with respect to the cathode. 
e the , . 
7 1F. B. Llewellyn and L. C. Peterson, “Vacuum Tube Networks,” Proceedijgs , 
Irom I.R.E., March, 1944. 














610 BELL SYSTEM TECHNICAL JOURNAL 


With current and voltage directions as in Figs. 16, 17 and 18, the follow- 
ing Tables I, II and III list the four-pole parameters for the three modes of 
triode operation in both 8 and Z forms. 


PABLE I 


Four-PoLE PARAMETERS FOR GROUNDED CATHODE TRIODE 
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w=amplication factor 


The y admittance coefficients appearing in the above tables were fully 
explained and discussed in the paper on Vacuum Tube Networks, to which 
reference has already been made. Suffice it here to say that yy is the 
admittance of the diode coinciding with cathode and equivalent grid plane 
and yo the admittance of the diode coinciding with the equivalent grid plane 
and the anode and finally ya; the transadmittance between these fictitious 
diodes. The admittance y;; depends upon the d-c conditions between cath- 
ode and grid and upon the transit angle for this region alone. The diode 
admittance ye depends in a similar manner upon the d-c space charge 
conditions in the grid-anode region as well as upon the transit angle for this 
region alone. For the small degree of space charge which usually exists 
between grid and plate of most triodes, yx can be represented by a simple 
capacitance. The transadmittance yo; can be resolved into two factors, the 
first of which depends only upon the transit angle between cathode and 
grid, and the second only upon the transit angle between grid and anode. 
In the paper on vacuum tube networks all these admittances were plotted 








W- 


; of 





Four-Po.t 


uw = amplification factor 


Four-Po.t 


Vit 
Bu = 
Mi 
Ba = 
D 
Bi 
As = 
Ba 
D 1 
1 
| 
1 
Zu = 
Y¥22 
Zu 
Az ed 
Zu 


uw = amplication factor 


TABLE II 


PARAMETERS FOR GROUNDED GRID TRI 
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PARAMETERS FOR GROUNDED PLATE TRIODE 
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graphically, showing both phase and magnitude, and this paper is referred 
to for details." 

lables I, I] and LI, in conjunction with Figs. 8, 13 and 15, allow us to 
derive the equivalent networks of bigs. 19, 20 and 21. 

We must now undertake a discussion of the results given in the tables as 


well as of the networks which were derived from them. 
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Fig. 19 (a, b, ¢ Three forms of equivalent circuits of the grounded cathode triode 
valid at all frequencies 


Initially, it is well to emphasize again that the four-pole parameters and 
the corresponding networks are different but equivalent ways through which 
the triode signal behavior becomes completely specified for all conditions of 
space charge and for all frequen ies. 

Secondly, there are certain general relations which should be noted. We 
observe, first, that the determinants A; and Az are invariants for the dif- 
ferent modes of triode operation, and with exception of phase reversals this 
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the driving force per unit input voltage, is invariant (except for reversal 1 
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also the case for the effective transadmittance 8y + 32; as well as for the 
ffective transimpedance Zy» + Z, . On the other hand, the quantity 
Aa i Za 


Zi 


, Which appears in the network of Fig. 15 and which represents 


il 


hase) only for grounded grid and grounded cathode operation. Several 
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Fig. 20 (a, b, c)—Three forms of equivalent circuits of the grounded grid triode valid 


at all frequencies. 


admittance and impedance relations may also be pointed out. For example, 
the input short-circuit driving-point admittance 3); is equal for grounded 
cathode and grounded plate operation and the same is true for the output 
short-circuit driving-point admittances 8, for grounded cathode and 
grounded grid operation. Moreover, it is also seen that the input short- 
circuit driviny-point admittance 8,, for grounded grid operation is equal to 
the output driving-point admittance 82 for grounded plate operation. A 
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similar set of recipsocal relations between the open-circuit driving-poir 
impedances is also present. 

In regard to the networks it may first be observed that, since they wer 
derived from parameters upon which no restrictions had been placed o 
either frequency or space charge, they are also generally valid. In passiv: 
circuit theory one is accustomed to the use of only the three basic elements 
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Fig. 21 (a, b, c)—Three forms of equivalent circuits of the grounded plate triode valid 
at all frequencies. 


of resistance, inductance and capacitance. For the networks now under 
consideration other quantities also need to be included. However, as 
normally operated there is usually complete space-charge in the cathode-grid 
region and a very small amount of space charge in the grid-plate region. 
Under such circumstances the admittance ys: is a simple capacitance and the 
amplification factor uw is a real number. The admittances y,; and ye , on 
the other hand, do not allow accepted circuit interpretation to be made, 
except in the range of moderately low frequencies when electron transit 
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me is taken into account only to a first order of approximation. It is 
elieved that, in general, these admittances should be considered complete 
by themselves as new admittance elements, and, as already remarked, their 
values in magnitude and phase may be found in the paper on vacuum tube 
networks to which repeated reference has been made. 

As a further property of the networks, consider the expressions for the 
3’s in Tables I, Il and III. It is observed that each 8;; of a set of 8’s con- 
tains a common term, suggesting that the networks might be broken up into 
at least two elementary constituents. The same observation applies to 
each of the three sets of Z’s. In Table I, for example, it is seen that this 
constant term is represented by yx/D. The network of Fig. 19a can thus 
be thought of as arising from the superposition of two networks, one of which 
is determined by the parameters 


’ Yu + Yor , 
B =* - 3 9 = 0 | 
ll D 12 
(25) 
t ye1 , Yu 
pu = * fa = =, | 
oS oe uD 
and the other by the parameters 
” Yoo ” Y22 | 
8 = — bie = —* 
ae 3 D . 
{ 26 ) 
” Voo ” Voo 
bun = — fae = = 
7, oe _ D 


The admittance coefficients given by (25) correspond to the perfectly uni- 
lateral active network shown in Fig. 22a, while the admittance coefficients 
in (26) correspond to the “passive” network in Fig. 22b. The two element- 
ary constituents thus take the general forms of these networks. It should 
be noticed that these two network constituents are unrelated to the fact 
that the total current entering the complete network is the sum of conduction 
and displacement current. 

Corresponding to the Z’s other elementary constituents are obtained, with 
general forms as shown on Figs. 23a and 23b. 

These elementary constituents are merely retlections of certain mathe- 
matical identities. For example, the networks in Fig. 22 depend upon the 
matrix identity 


Buby Bu + Br 0 | — BB 


+ (27) 
|| B21 B22 || Ba +.B Boe — Bie} — PBB 


il 
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while those in Fig. 23 depend upon a corresponding identity. The identi 
(27) expresses the fact that the general ‘‘3-network” can be considered 
arising from the parallel connections of the two networks in Fig. 24. 
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Fig. 22 (a, b)——Elementary constituents of Fig. 19a 
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Fig. 23 (a, b) Elementary constituents of Fig. 19b. 
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Fig. 24 (a, b)—Elementary constituents of Fig. 8 


There is at present a tendency towards grid designs of very fine mesh. 
Such a grid design results in a very large value of the amplification factor 
and, for many purposes, sufficient accuracy may be obtained by disregarding 
terms containing — as a factor. Under these conditions the network for 

bb 
grounded cathode operation reduces to an L-network, that for grounded 
grid operation to a unilateral network transmitting in the direction from grid 
to plate only, while the cathode follower network remains essentially un- 
changed. 
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In the foregoing, the behavior of an active four-pole has been descri}ed 


either in terms of four-pole parameters or in terms of elements of an equiva 
lent circuit. The particular four-pole parameters, which are of customan 
use in communication engineering, are the so-called image parameters, 
they have usually been used only in connection with passive four-p 
They may, however, also be used in the more general vacuum tube four-| 
now under discussion, but whether their employment would be of pract 
TABLE VI 


Four-PoLE PARAMETERS FOR GROUNDED PLATE TRIODE IN THE RANGE OF 
MOopeERATELY Low FREQUENCIES 


~s 
3 2 ; ii 3 v) 1 
pis TT pa = 7 « ‘ 
1 


value is a matter which engineering experience will decide. In any case 
their use would be limited to vacuum tubes in which appreciable interaction 
between input and output terminals is present. From a practical standpoint 
this means that their usefulness would be mainly found in connection with 
triodes. 


PRIODE NETWORKS AT MODERATELY LOW FREQUENCIES 


The networks discussed in the preceding section were of general validity 
in respect to both operating conditions and frequency and it was mentioned 
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t the efforts of trying to interpret the “passive” parts of the networks 
the form of lumped passive circuit elements had, in general, met with not 
» much success. In this section attention will be given to the range of 
oderately low frequencies where usual circuit interpretation is possible 
[he operating conditions are assumed to be the usual ones with complete 
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space charge in the cathode-grid region, and negligible space charge in the 
grid-plate region. Also it will be assumed that the grid is at negative d-c 
potential with respect to the cathode. 

The first step is to expand the 8 coefficients in series with transit angles 
retained only to first or possibly to second orders. As the detailed computa- 
tions are lengthy only the final result will be given. These are presented in 


Tables IV, V and VI. 
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Fig. 26-—-Equivalent circuit of grounded grid triode at moderately high frequencies 


In these tables the symbols have the following meanings: 


gy = static conductance of the diode formed by the cathode and equivalent 
grid-plane. 

ux = low frequency amplification factor. 

x; = cathode-grid distance in cm. 

x. = grid-anode distance in cm. 


(, = cold capacitance between cathode and equivalent grid plane 

(2 = cold capacitance between anode and equivalent grid plane. 

6, = electron transit angle between cathode and equivalent grid plane 
It is most simply calculated from 


9 whi 


So 


i, = 


0, = electron transit angle between anode and equivalent grid plane. 





FQUIVALENT CIRCUITS OF NETWORKS 621 


C + 
(B12+Ao2; | 


ae ote SF 
Ly 
i 


deci: 

















wf 
S Mu 1 } i 
< : + - )| 
i a ,] ! 
, I 
( ( 
1 | ) 
1 + 1 + 
lb 3 
1 
1 
( ue 
= F = 
5 | 1 { 
1+ 1+ - 
Me ’ | 
} f 
( = ( 
3 1 4 
14 1+--f 
ey 4 
1 
, l 
lencies ee ub 


livalent 
| 
I 
0 ( 1 1 | 
( = ' 1 + 
10 ul 1 $ 4 | 3 l } | 
1 + 1 + : f | 1 j + f 
Me 3 3 4 ‘ 
10 6 
7 ot ae 
plane ert ate 
15 1 $4 
1 + 1 + : f 
u 34 
Fig. 27—Equivalent circuit of grounded plate triode at moderately hig! 1! 
plane. 


Sib 











@22 BELL SYSTEM TECHNICAL JOURNAL 


It may be calculated from 


w2Xe 
6. = 
V2n(V Vor + V Vie 
where 7 10 1./6 X 10% 
mM 
Vp, = equivalent d-c grid potential 
Vine = anode d-c potential 
220. YVn +t2VVpo , 5/0 WVp, + 3V Vp 
/ 1 + Bean ty ae 
“"VVn + VVm 3 \G1 VVun + V Vo 
L602 V Vm + 2V Vw 


2 4, N V v1 T \ V pe 


V Vn + 2V Vw 
VVn + V Vo 

With the aid of these tables and Fig. 8 the equivalent circuits on Figs. 25, 
26 and 27 are obtained. The networks are all of the resistance-capacity 
type. It may be noted that, in some of the branches, negative conductance 
or negative capacitance appears. However, as seen from the external tube 
terminals they are swamped by corresponding positive elements. 

rhe viewpoints presented in this paper have been used by the writer over 
a number of years. They have been given experimental application by 
Mr. J. A. Morton, who is principally responsible for their introduction and 
use in the studies in these Laboratories of electron tubes in the microwave 
regions. 

With this, our investigation comes to a close. Much has been omitted, 
particularly in the field of applications, but it is nevertheless hoped the funda- 
mental approach, as well as the networks given, may prove to be useful in 
practical applications. The questions of noise and of optimum noise figure 
design have also been left out of consideration. Mr. J. A. Morton and the 
writer plan to discuss these problems in a forthcoming paper. 

The writer is pleased to acknowledge his indebtedness to Messrs. R. K. 
Potter, J. A. Morton, and R. M. Ryder, who have encouraged this work 
and urged its publication; and to Mr. W. E. Kirkpatrick for constructively 
critical scrutiny of the original technical memorandum. 
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A Mathematical Theory of Communication 


By C. E. SHANNON 


Concluded from July 1948 issue 


PART III: MATHEMATICAL PRELIMINARIES 


In this final installment of the paper we consider the case where the 
signals or the messages or both are continuously variable, in contrast with 
the discrete nature assumed until now. To a considerable extent the con 
tinuous case can be obtained through a limiting process from the discrete 
case by dividing the continuum of messages and signals into a large but finite 
number of small regions and calculating the various parameters involved on 
a discrete basis. As the size of the regions is decreased these parameters in 
general approach as limits the proper values for the continuous case. There 
are, however, a few new effects that appear and also a general change of 
emphasis in the direction of specialization of the general results to particu 
lar cases. 

We will not attempt, in the continuous case, to obtain our results with 
the greatest generality, or with the extreme rigor of pure mathematics, since 
this would involve a great deal of abstract measure theory and would ob 
scure the main thread of the analysis. A preliminary study, however, indi 
cates that the theory can be formulated in a completely axiomatic and 
rigorous manner which includes both the continuous and discrete cases and 
many others. The occasional liberties taken with limiting processes in th 
present analysis can be justified in all cases of practical interest. 


18. Sets AND ENSEMBLES OF FUNCTIONS 


We shall have to deal in the continuous case with sets of functions and 
ensembles of functions. A set of functions, as the name implies, is merely a 
class or collection of functions, generally of one variable, time. It can be 
specified by giving an explicit representation of the various functions inthe 
set, or implicitly by giving a property which functions in the set possess and 
others do not. Some examples are: 

1. The set of functions: 


fo(t) = sin (¢ + 8). 


Each particular value of @ determines a particular function in the set. 


623 
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The set of all functions of time containing no frequencies over W' cyck 
per sec ond, 
3. The set of all functions limited in band to W and in amplitude to 1. 
+. The set of all English speech signals as functions of time. 
An ensemble of functions is a set of functions together with a probabilit 
measure whereby we may determine the probability of a function in th 


set having certain properties. For example with the set, 


sin ({ + @), 


we may give a probability distribution for 6, P(@). The set then becomes 
an ensemble. 
Some further examples of ensembles of functions are: 
1. A tinite set of functions fi(/) (Rk 1, 2, +++, ) with the probability of 
f, being fp 


\ finite dimensional family of functions 


with a probability distribution for the parameters a 
Play y Pew es 


lor example we could consider the ensemble detined by 


Gyo? ty Oey * °* gO ES = ad, sin nlwl + @,,) 


1 


with the amplitudes a, distributed normally and independently, and the 
phrases 6, distributed uniformly (from 0 to 27) and independently. 
3. The ensemble 


* sin r(2Wt — 2) 
Nas , uy >a, 


2 w2Wi— n) 


with the a, normal and independent all with the same standard deviation 
Vv NV. This isa representation of “white” noise, band-limited to the band 
from 0 to W cycles per second and with average power V. 


In mathematical terminology the functions belong to a measure space whose total 
measure is unity 

Phis representation can be used as a definition of band limited white noise. It has 
certain advantages in that it involves fewer limiting operations than do definitions that 
have been used in the past. The name ‘white noise,’ already firmly intrenched in the 
literature, is perhaps somewhat unfortunate. In optics white light means either any 
continuous spectrum as contrasted with a point spectrum, or a spectrum which is flat with 
vavelength (which is not the same as a spectrum flat with frequency). 
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Let points be distributed on the ¢ axis according to a Poisson distribu- 
tion. At each selected point the function /(/) is placed and the different 
functions added, giving the ensemble 


2 


Zz f+ &) 

k 2 
where the & are the points of the Poisson distribution. This ensemble 
can be considered as a type of impulse or shot noise where all the impulses 
are identical. 

5. The set of English speech functions with the probability measure given 

by the frequency of occurrence in ordinary use. 
An ensemble of functions fa(/) is stationary if the same ensemble results 


when all functions are shifted any fixed amount in time. The ensemble 
fe(t) = sin (¢t + 6) 


is stationary if @ distributed uniformly from 0 to 27. If we shift each func- 
tion by 4 we obtain 


fo(t + t;) = sin (tf + 4 + 8) 
= sin (tf + o¢) 


with ¢ distributed uniformly from 0 to 27. Each function has changed 
but the ensemble as a whole is invariant under the translation. The other 
examples given above are also stationary. 

An ensemble is ergodic if it is stationary, and there is no subset of the func 
tions in the set with a probability different from 0 and 1 which is stationary 
The ensemble 


sin (¢ + @) 


isergodic. No subset of these functions of probability #0, 1 is transformed 
into itself under all time translations. On the other hand the ensemble 


asin (¢ + @) 


with @ distributed normally and @ uniform is stationary but not ergodic 
The subset of these functions with a between 0 and 1 for example is 
stationary. 

Of the examples given, 3 and 4 are ergodic, and 5 may perhaps be con- 
sidered so. If an ensemble is ergodic we may say roughly that each func- 
tion in the set is typical of the ensemble. More precisely it is known that 
with an ergodic ensemble an average of any statistic over the ensemble is 
equal (with probability 1) to an average over all the time translations of a 
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particular function in the set.” Roughly speaking, each function can be 
pected, as time progresses, to go through, with the proper frequency, all 
convolutions of any of the functions in the set. 

Just as we may perform various operations on numbers or functions jo 
obtain new numbers or functions, we can perform operations on ensemb 
to obtain new ensembles. Suppose, for example, we have an ensemble 
functions f,(¢) and an operator T which gives for each function fa(t) a res 


Lall): 
La {) = Tfa(t) 


Probability measure is detined for the set ga(/) by means of that for the se 
fa(t). The probability of a certain subset of the go(/) functions is equa 
to that of the subset of the fa(/) functions which produce members of th 
given subset of g functions under the operation 7. Physically this corr 
sponds to passing the ensemble through some device, for example, a filter, 
a rectifier or a modulator. The output functions of the device form the 
ensemble ga(/). 

A device or operator 7 will be called invariant if shifting the input merel) 
shifts the output, ie., if 


galt) = Tfa(t) 
implies 
Lall T fh) wi That T fy) 


for all fa(d) and all 4. It is easily shown (see appendix 1) that if 7 is in 
variant and the input ensemble is stationary then the output ensemble is 
stationary. Likewise if the input is ergodic the output will also be ergodic 

A filter or a rectifier is invariant under all time translations. The opera 
tion of modulation is not since the carrier phase gives a certain time stru 
ture. However, modulation is invariant under all translations which are 
multiples of the period of the carrier. 

Wiener has pointed out the intimate relation between the invariance ot 
physical devices under time translations and Fourier theory.” He has 


This is the famous ergolic theorem or rather one aspect of this theorem which was 
proved is somewhat different formulations by Birkhoff, von Neumann, and Koopman, an 
subsequently generalized by Wiener, Hopf, Hurewicz and others. ‘The literature on ergodi 
theory is quite extensive and the reader is referred to the papers of these writers for pré 
cise and general formulations; e.g., E. Hopf * Ergodentheorie’’ Ergebnisse der Mathemati: 
und ihrer Grenzgebiete, Vol. 5, “On Causality Statistics and Probability’? Journal o! 
Mathematics and Physics, Vol. NIII, No. 1, 1934; N. Weiner “The Ergodic Theorem” 
Duke Mathematical Journal, Vol. 5, 1939. 

‘ Communication theory is heavily indebted to Wiener for much of its basic philosophy 
and theory. His classic NDRC report ‘The Interpolation, Extrapolation, and Smoothing 
of Stationary Time Series,’’ to appear soon in book form, contains the first clear-cut 
formulation of communication theory as a statistical problem, the study of operations 
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ywn, in fact, that if a device is linear as well as invariant Fourier analysis 
; then the appropriate mathematical tool for dealing with the problem 

An ensemble of functions is the appropriate mathematical representation 

the messages produced by a continuous source (for example speech), of 
the signals produced by a transmitter, and of the perturbing noise. Com 
munication theory is properly concerned, as has been emphasized by Wiener 
not with operations on particular functions, but with operations on en 
sembles of functions. A communication system is designed not for a pat 
ticular speech function and still less for a sine wave, but for the ensemble of 


speech functions. 


19. BAND Limirep ENSEMB ES OF FUNCTIONS 
If a function of time f(/) is limited to the band from 0 to HW cycles per 
second it is completely determined by giving its ordinates at a series of dis 


1 a ; 
crete points spaced yy Se onds apart in the manner indicated by the follow 


ing result.” 
Theorem 13: Let f(t) contain no frequencies over W. 


Then 
< sin r(2Wt — n) 
(t)= > X, 
L r(2Wt — n) 
where 
y re 
sai 2W 


In this expansion /(/) is represented as a sum of orthogonal functions 
The coefficients Y, of the various terms can be considered as coordinates in 
an infinite dimensional “function space.”’ In this space each function cor 
responds to precisely one point and each point to one function. 

A function can be considered to be substantially limited to a time T if all 
the ordinates Y,, outside this interval of time are zero. In this case all but 
2TW of the coordinates will be zero. Thus functions limited to a band Ii 
and duration T correspond to points in a space of 271 dimensions. 

A subset of the functions of band HW and duration T corresponds to a re- 
gion in this space. For example, the functions whose total energy is less 


on time series. This work, although chiefly concerned with the linear prediction and 
filtering problem, is an important collateral reference in connection with the present paper 
We may also refer here to Wiener’s forthcoming book “Cybernetics”? dealing with the 
general problems of communication and control. 

5 For a proof of this theorem and further discussion see the author’s paper ‘“‘Cornmuni 
cation in the Presence of Noise’? to be published in the Proceedings of the Institute of Radio 
Engineers. 
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than or equal to & correspond to points in a 27W dimensional sphere \ 
radius r = ~/2WE. 

An ensemble of functions of limited duration and band will be represen 
by a probability distribution p(a, --- x,) in the corresponding n dimensio 
space. If the ensemble is not limited in time we can consider the 2711 
ordinates in a given interval 7 to represent substantially the part of 1 
function in the interval 7 and the probability distribution p(x, --- 
to give the statistical structure of the ensemble for intervals of that duratic 


20. ENTROPY OF A CONTINUOUS DISTRIBUTION 
The entropy of a discrete set of probabilities p; , --- p, has been detined a 
n= -> pi log pi . 


In an analogous manner we define the entropy of a continuous distributio 
with the density distribution function p(«) by: 


oc 
H=- [ p(x) log p(x) dx 
“ eo 
With an » dimensional distribution p(x, ,--- ,x,) we have 
H = - | te P(x: ++ xn) log play, +++, Xn) dxy- ++ dx,. 


If we have two arguments x and y (which may themselves be multi-dimen 
sional) the joint and conditional entropies of p(x, y) are given by 


H(x, y) = -[f p(x, vy) log p(x, y) dx dy 


and 
> fad (x ‘ 
HAy) = — | | p(x, vy) log sont dx dy 
H,(x) = - | | p(x, y) log ? ae dx dy 
where 


pix) = [ vx, y) dy 


p(y) = [ x, y) dx. 


The entropy of continuous distributions have most (but not all) of th 
properties of the discrete case. In particular we have the following: 
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1. If xis limited to a certain volume ¢ in its space, then //(.x) is a maximum 


' | : 
and equal to log v when p(..) is constant ( in the volume 


2. With any two variables x, vy we have 


H(x, vy) < H(x) + HW 


with equality if (and only if) x and y are independent, i.e., p(x, 4 p 


p(y) (apart possibly from a set of points of probability zero 


3. Consider a generalized averaging operation of the following type: 


“ 


p’(y) = ax, v) p(x) dx 


with 


| a(x, vy) dx = a(x, y) dy ‘; a(x, v 0 


. . 


Phen the entropy of the averaged distribution p’(y) is equal to or greater 
than that of the original distribution p(.) 


1. We have 


H(x, vy) = A(x) + HAy) = Hy) + Ala 


and 


Hy) < H(y). 


5. Let p(x) be a one-dimensional distribution. The form of p(x) giving a 


maximum entropy subject to the condition that the standard deviation 
of x be fixed at o is gaussian. To show this we must maximize 


Hiz) = —| p(x) log p(x) da 


with 


- 


o = p(x)x dx and 1 p(x) dx 
as constraints. This requires, by the calculus of variations, maximizing 
| [— p(x) log p(x) + Ap(x)x” + ppl) de. 


The condition for this is 
—1 — log p(x) + Ax + 4 = 0 


and consequently (adjusting the constants to satisfy the constraints) 
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Similarly in 2 dimensions, suppose the second order moments 
p(x, °*:> ,%,) are fixed at A 


y | ae | Re Xi Px, °° , Xn) days +> dx, . 


Then the maximum entropy occurs (by a similar calculation) whe: 
p(x, ++: ,¥,) is the nv dimensional gaussian distribution with the sec 
order moments 1, 

6. The entropy of a one-dimensional gaussian distribution whose standar 


deviation is o is given by 
H(x) = log \ 2rec. 


This is calculated as follows: 
(x) 1 ss 
P(x) = \ ¢ 
xc 
—log p(x) = log YW2re + 202 


H(x) = - [ p(x) log p(x) dx 


= p(x) log W/ 20 dx + [ p(x) s , ax 


20? 


3 
2c” 


= log Vv Qno + 


= log VY2ro + log Ve 
= log V2rec. 


Similarly the » dimensional gaussian distribution with associated 
quadratic form aj; is given by 
ai; 


—— exp (— }2a,; X; Xj) 
(2a)! : 


DA Rp + * = a) 


and the entropy can be calculated as 
H = log (27e)" F | ai; |’ 
where | aj; | is the determinant whose elements are a;; 
7. If x is limited to a half line (p(x) = 0 for x < 0) and the first moment of 
x is fixed at a: 


a - | p(x)x dx, 








| 
i 
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ments 5 then the maximum entropy occurs when 
1 
P(x) = e 
ad 
and is equal to log ea. 
n) whe s. There is one important difference between the continuous and discrete 
1e secon, entropies. In the discrete case the entropy measures in an absolule 
way the randomness of the chance variable. In the continuous case the 
standar measurement ts relative to the coordinate system. If we change coordinates 
the entropy will in general change. In fact if we change to coordinates 
v-c+* ¥, the new entropy is given by 
. J x x 
H(y) = oes E(x +++ 4, J ( log P(x s+: XI dy, dy 
‘ d v y 
where J = is the Jacobian of the coordinate transformation. On ex 
panding the logarithm and changing variables to x; --- x, , we obtain: 

H(y) = H(x) - vee Pim, °°* , ta) log J (*) an +++ dx 

d y 
Thus the new entropy is the old entropy less the expected logarithm of 
the Jacobian. In the continuous case the entropy can be considered a 
measure of randomness relative /o an assumed standard, namely the co- 
ordinate system chosen with each small volume element dx; --- dx, given 
equal weight. When we change the coordinate system the entropy in 
the new system measures the randomness when equal volume elements 
dy, --+ dy, in the new system are given equal weight. 

In spite of this dependence on the coordinate system the entropy 

ociated concept is as important in the continuous case as the discrete case. This 
is due to the fact that the derived concepts of information rate and 
channel capacity depend on the difference of two entropies and_ this 
difference does not depend on the coordinate frame, each of the two terms 
being changed by the same amount. 

The entropy of a continuous distribution can be negative. The scale 
of measurements sets an arbitrary zero corresponding to a uniform dis 
tribution over a unit volume. A distribution which is more confined than 
this has less entropy and will be negative. The rates and capacities will, 

nent of however, always be non-negative. 
9. A particular case of changing coordinates is the linear transformation 
V= _ a 
; 
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In this case the Jacobian is simply the determinant | a;; |~' and 
H(y) = H(x) + log | a;; 


In the case of a rotation of coordinates (or any measure preserving trans 
formation) J = land H/(y) = H(x). 


21. ENTROPY OF AN ENSEMBLE OF FUNCTIONS 


Consider an ergodic ensemble of functions limited to a certain band 
width HW’ cycles per second. Let 


p(w ++ Xn) 

be the density distribution function for amplitudes 1, --- x, at m successive 
sample points. We define the entropy of the ensemble per degree of fre: 
dom by 


: ar F 
H —Lim =. ia plxy- ++ Xn) log pla, +++, Xn) dx +++ dx, 
We may also define an entropy H7 per second by dividing, not by n, but by 
the time T in seconds for n samples. Since n = 2TW, H’ = 2H. 


With white thermal noise p is gaussian and we have 
H’ = log V 2reN, 
H = W log 2reN. 


For a given average power .V, white noise has the maximum possible 
entropy. This follows from the maximizing properties of the Gaussia: 
distribution noted above. 

The entropy for a continuous stochastic process has many properties 
analogous to that for discrete processes. In the discrete case the entropy 
was related to the logarithm of the probability of long sequences, and to th 
number of reasonably probable sequences of long length. In the continuous 
case it is related in a similar fashion to the logarithm of the probabili!) 
density for a long series of samples, and the volume of reasonably high prob 
ability in the function space. 

More precisely, if we assume p(x; --~ x,) continuous in all the «; for all » 
then for sufficiently large n 


log p — All< 


~~. € 
iN 


for all choices of (4, °°: , *,) apart from a set whose total probability is 


less than 6, with 6 and ¢ arbitrarily small. This follows from the ergodi 
property if we divide the space into a large number of small cells. 
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The relation of 17 to volume can be stated as follows: Under the same as- 
mptions consider the 7 dimensional space correspondin, 





Let V,.(g) be the smallest volume in this space which it 
. total probability g. Then 


log 1,.(q) 


H’ 


Lim 
n—o ot i 
provided g does not equal 0 or 1. 
These results show that for large 7 there is a rather well-detined volume (at 


east in the logarithm sense) of high probability, and that within 





sense). 


In the white noise case the distribution function is given by 


1 . 
(a, --+4,) = exp — Dx; 
p 1 n/ (QxN yn/2 I IN i 
Since this depends only on 2x; the surfaces of equal probability density 
are spheres and the entire distribution has spherical symmetry. The region 
of high probability is a sphere of radius nN. As n—> « the probability 
of being outside a sphere of radius ¥/ n(N + €) approaches zero and — times 
the logarithm of the volume of the sphere approaches log y 27eN. 

In the continuous case it is convenient to work not with the entropy // of 
an ensemble but with a derived quantity which we will call the entropy 
power. This is defined as the power in a white noise limited to the same 
band as the original ensemble and having the same entropy. In other words 
if H’ is the entropy of an ensemble its entropy power is 

1 — 

V; = exp 2H . 

2re 
In the geometrical picture this amounts to measuring the high probability 
volume by the squared radius of a sphere having the same volume. Since 
white noise has the maximum entropy for a given power, the entropy power 
of any noise is less than or equal to its actual power 
21. Entropy Loss in LINEAR FILTERS 

Theorem 14: If an ensemble having an entropy //; per degree of freedom 
in band W is passed through a filter with characteristic }(/) the output 
ensemble has an entropy 


H, = H,+ > | log | ¥(f) | df. 
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The operation of the filter is essentially a linear transformation of « 
ordinates. If we think of the different frequency components as the origin 
coordinate system, the new frequency components are merely the old ones 
multiplied by factors. The coordinate transformation matrix is thus e 


TABLE I 





























ENTROPY | ENTROPY 
GAIN POWER | POWER GAIN IMPULSE RESPONSE 
FACTOR |IN DECIBELS 
2 
sin* rt 
- 8.68 ar 
(7t) 
es 2|SINt _ cost 
| 
| 
=% 
_— ¢| cost - cost, swt] 
L. @ 2t2 A J 
— +—_—— — —_—__— — | 
mr JS; (t) | 
- 2.66 : oe 
| 
| -s6sa | —~ [cos (-ayt-cost| 
} at 
| | 
| 
| 








sentially diagonalized in terms of these coordinates. The Jacobian of the 
transformation is (for ” sine and n cosine components) 


J=T[\y(? 
i—1 
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where the f; are equally spaced through the band HW’. This becomes in 
the limit 


exp 


[ log | V(/) |’ df. 


W Jw 


Since J is constant its average value is this same quantity and applying the 
theorem on the change of entropy with a change of coordinates, the result 
follows. We may also phrase it in terms of the entropy power. Thus if 
the entropy power of the first ensemble is .V, that of the second is 


| log | V(f) |" df. 
“u 


Ny exp iW 
The final entropy power is the initial entropy power multiplied by the geo- 
metric mean gain of the filter. If the gain is measured in db, then the 
output entropy power will be increased by the arithmetic mean dd gain 
over W. 

In Table I the entropy power loss has been calculated (and also expressed 
in db) for a number of ideal gain characteristics. The impulsive responses 
of these filters are also given for 7 = 22, with phase assumed to be 0. 


The entropy loss for many other cases can be obtained from these results. 
' ; P 
For example the entropy power factor —, for the tirst case also applies to any 
e 


gain characteristic obtained from 1 — w by a measure preserving transforma- 
tion of the w axis. In particular a linearly increasing gain G(w) = , or a 
“saw tooth” characteristic between 0 and 1 have the same entropy loss. 
rhe reciprocal gain has the reciprocal factor. Thus — has the factor ¢ 
me 
Raising the gain to any power raises the factor to this power. 
22. ENTROPY OF THE SUM OF TWo ENSEMBLES 
If we have two ensembles of functions fa(/) and g3(/) we can form a new 
ensemble by ‘addition.’ Suppose the first ensemble has the probability 
density function p(x, +--+, x,) and the second g(a, ++: , x). Then the 
density function for the sum is given by the convolution: 
Ht 882 oe) = cee PAV, °° * » Val 
G(X; — V5 °** Xn — Yn) dy, dye, ---, dy, 
Physically this corresponds to adding the noises or signals represented by 


the original ensembles of functions. 
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The following result is derived in Appendix 6. 

Theorem 15: Let the average power of two ensembles be N, and Nz a: 
let their entropy powers be N; and N,. Then the entropy power of t 
sum, N;, is bounded by 


N3 ee N, + Nz . 


Ni, + Nz 


lA 


White Gaussian noise has the peculiar property that it can absorb an 
other noise or signal ensemble which may be added to it with a resultant 
entropy power approximately equal to the sum of the white noise power an: 
the signal power (measured from the average signal value, which is normal|; 
zero), provided the signal power is small, in a certain sense, compared to 
the noise. 

Consider the function space associated with these ensembles having » 
dimensions. The white noise corresponds to a spherical Gaussian distribu 
tion in this space. The signal ensemble corresponds to another probability 
distribution, not necessarily Gaussian or spherical. Let the second moments 
of this distribution about its center of gravity be a,;;. That is, if 


p(a, ++: , X,) is the density distribution function 
ay = [+ [ ple — alas — a) de, +++, diy 


where the a; are the coordinates of the center of gravity. Now a,; is a posi 
tive definite quadratic form, and we can rotate our coordinate system to 
align it with the principal directions of this form. aj; is then reduced to 
diagonal form };;. We require that each 6;; be small compared to N, the 
squared radius of the spherical distribution. 

In this case the convolution of the noise and signal produce a Gaussian 
distribution whose corresponding quadratic form is 


N+ by. 
The entropy power of this distribution is 


[1N + 0,4)]" 
or approximately 


= ((N)" + 5),(N)"¢}" 


N + : Tdi. 
n 


The last term is the signal power, while the first is the noise power. 
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MATHEMATICAL THEORY OF COMMUNICATION 
PART IV: THE CONTINUOUS CHANNEL 


23. THe CAPACITY OF A CONTINUOUS CHANNEI 


In a continuous channel the input or transmitted signals will be con- 
tinuous functions of time f(/) belonging to a certain set, and the output or 


received signals will be perturbed versions of these. We will consider only 
the case where both transmitted and received signals are limited to a certain 


band W. 


their statistical structure by 


They can then be specified, for a time 7, by 2TW numbers, and 
finite dimensional distribution functions 
[hus the statistics of the transmitted signal will be determined by 


Play, <-> , Xe) = Pts) 
and those of the noise by the conditional probability distribution 


Payyesan(Mry °° Ya) = Pay). 


The rate of transmission of information for a continuous channel is defined 
in a way analogous to that for a discrete channel, namely 
R= H(x) — H,(x) 
where H(x) is the entropy of the input and H/,(x) the equivocation. The 
channel capacity C is defined as the maximum of R when we vary the input 


over all possible ensembles. This means that in a finite dimensional ap- 


proximation we must vary P(x) = P(x, +> , x.) and maximize 
a “ P( x, y) 
— P(x) log P(x) dx + [ P(x, vy) log — dx dy. 
d PF . P(y) 


his can be written 
P(x, y) 


| | P(x, vy) log Pla) Pl z= dx dy 


o 


using the fact that [ Po, y) log P(x) dx dy = [ vi r) log P(x) dx. The 


J 


channel capacity is thus expressed 


C = Lim Max 7, [ [ P(x, y) log 9, ae dy. 


(x) P(y) 


T—-0 P(z) 
It is obvious in this form that R and C are independent of the coordinate 


P(x, v) 
system since the numerator and denominator in log 


© P(x) Ply 


plied by the same factors when x and y are transformed in any one to one 


will be multi 


way. This integral expression for C is more general than H(x) — H,(. 
H,(x) 


Properly interpreted (see Appendix 7) it will always exist while H (x) 
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may assume an indeterminate form = x insome cases. This occurs, 
example, if x is limited to a surface of fewer dimensions than » in its 7 dim 
sional approximation. 

If the logarithmic base used in computing //(v) and //,(x) is two then 
is the maximum number of binary digits that can be sent per second over t! 
channel with arbitrarily small equivocation, just as in the discrete cas: 
This can be seen physically by dividing the space of signals into a large nur 
ber of small cells, sufficiently small so that the probability density P,! 
of signal x being perturbed to point y is substantially constant over a c¢ 
(either of x ory). If the cells are considered as distinct points the situatio: 
is essentially the same as a discrete channel and the proofs used there wi 
apply. But it is clear physically that this quantizing of the volume into 
individual points cannot in any practical situation alter the final answer 
significantly, provided the regions are sufficiently small. Thus the capacity 
will be the limit of the capacities for the discrete subdivisions and this is 
just the continuous capacity detined above. 

On the mathematical side it can be shown first (see Appendix 7) that if 1 
is the message, x is the signal, y is the received signal (perturbed by noise 
and v the recovered message then 


H (x) H,(x) > H(u) H (a) 


regardless of what operations are performed on uv to obtain x or on y to obtain 
Thus no matter how we encode the binary digits to obtain the signal, or 
how we decode the received signal to recover the message, the discrete rate 
for the binary digits does not exceed the channel capacity we have defined. 
On the other hand, it is possible under very general conditions to find a 
coding system for transmitting binary digits at the rate C with as small an 
equivocation or frequency of errors as desired. This is true, for example, if, 
when we take a finite dimensional approximating space for the signal func- 
tions, P(x, vy) is continuous in both « and y except at a set of points of prob 
ability zero. 
An important special case occurs when the noise is added to the signal 
and is independent of it (in the probability sense). Then ?,(y) is a function 
only of the difference n = (y — x), 


PAy) = Oly — 2) 


and we can assign a definite entropy to the noise (independent of the sta- 
tistics of the signal), namely the entropy of the distribution Q(x). This 
entropy will be denoted by H(n). 

Theorem 16: If the signal and noise are independent and the received 
signal is the sum of the transmitted signal and the noise then the rate of 
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ransmission 1s 


R= H(y) — Hin 


i.e., the entropy of the received signal less the entropy of the noise. The 


channel capacity is 


C = Max Ay) — A(n). 


I 
We have, since y = x + 7: 
H(x, vy) = H(x, n). 
Expanding the left side and using the fact that x and » are independent 
H(y) + H,(x) = H(x) + H(n). 
Hence 
R = H(x) — H,(x) = H(y) — H(n). 

Since H(n) is independent of P(x), maximizing R requires maximizing 
H(y), the entropy of the received signal. If there are certain constraints on 
the ensemble of transmitted signals, the entropy of the received signal must 
be maximized subject to these constraints. 

24. CHANNEL CAPACITY WITH AN AVERAGE POWER LIMITATION 


A simple application of Theorem 16 is the case where the noise is a white 
thermal noise and the transmitted signals are limited to a certain average 
power P. Then the received signals have an average power P + .V where 
NV is the average noise power. The maximum entropy for the received sig- 
nals occurs when they also form a white noise ensemble since this is the 
greatest possible entropy for a power P + .V and can be obtained by a 
suitable choice of the ensemble of transmitted signals, namely if they form a 
white noise ensemble of power P. The entropy (per second) of the re 


ceived ensemble is then 
H(y) = W log 2re(P + N), 
and the noise entropy is 
Hin) = W log dreN. 


The channel capacity is 


“7 


| i 
C = My) - 1) = W log 
(y) — H(n) Nv 


y 


Summarizing we have the following: 
Theorem 17: The capacity of a channel of band W perturbed by white 
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thermal noise of power .V when the average transmitter power is P is given 


P+ N 


C = W log 


This means of course that by sufficiently involved encoding systems \ 


ete a . P+N.,. : 
can transmit binary digits at the rate W logs -— bits per second, wit 


N 
arbitrarily small frequency of errors. It is not possible to transmit at 
higher rate by any encoding system without a definite positive frequency « 
errors. 

» To approximate this limiting rate of transmission the transmitted signals 
must approximate, in statistical properties, a white noise.’ A system whic! 
approaches the ideal rate may be described as follows: Let = 2° samples 
of white noise be constructed each of duration 7. These are assigned 
binary numbers from 0 to (M — 1). At the transmitter the message se- 
quences are broken up into groups of s and for each group the corresponding 
noise sample is transmitted as the signal. At the receiver the M samples are 
known and the actual received signal (perturbed by noise) is compared with 
each of them. The sample which has the least R.M.S. discrepancy from the 
received signal is chosen as the transmitted signal and the corresponding 
binary number reconstructed. This process amounts to choosing the most 
probable (a posteriori) signal. The number M of noise samples used will 
depend on the tolerable frequency e of errors, but for almost all selections oi 
samples we have 


Lim Lim = W log 


e—0 T+ + 


log M(e, T) P+N 
- 


so that no matter how small ¢€ is chosen, we can, by taking T sufficiently 


) N 
large, transmit as near as we wish to TW log V binary digits in the 
time 7. 
, Tr ; , P+WN , 
Formulas similar to C = W log _ for the white noise case have 


been developed independently by several other writers, although with some- 
what different interpretations. We may mention the work of N. Wiener,’ 
W. G. Tuller, and H. Sullivan in this connection. 

In the case of an arbitrary perturbing noise (not necessarily white thermal 


noise) it does not appear that the maximizing problem involved in deter- 


6 This and other properties of the white noise case are discussed from the geometrical 
point of view in ‘Communication in the Presence of Noise,’’ loc. cit. 

7““Cybernetics,”’ loc. cit. 

8Sc. D. thesis, Department of Electrical Engineering, M.I.T., 1948. 
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mining the channel capacity C can be solved explicitly. However, upper 
id lower bounds can be set for C in terms of the average noise power .V 


and the noise entropy power .V,. These bounds are sufficiently close to- 
; 1 ; 


vether in most practical cases to furnish a satisfactory solution to the 
problem. 
Theorem 18: The capacity of a channel of band W perturbed by an arbi- 


trary noise is bounded by the inequalities 


ot P+wN 
i log r : <C < W log - 
iV] 4 
where 
P = average transmitter power 
N = average noise power 


N, = entropy power of the noise. 

Here again the average power of the perturbed signals will be P + NV. 
The maximum entropy for this power would occur if the received signal 
were white noise and would be W log 2re(P + N). It may not be possible 
to achieve this; i.e. there may not be any ensemble of transmitted signals 
which, added to the perturbing noise, produce a white thermal noise at the 
receiver, but at least this sets an upper bound to H(y). We have, therefore 


C = max H(y) — H(n) 
< W log 2re(P + N) — W log 2reN,. 


This is the upper limit given in the theorem. The lower limit can be ob 
tained by considering the rate if we make the transmitted signal a white 
noise, of power P. In this case the entropy power of the received signal 
must be at least as great as that of a white noise of power P + N; since we 
have shown in a previous theorem that the entropy power of the sum of two 
ensembles is greater than or equal to the sum of the individual entropy 
powers. Hence 
max H(y) > W log 2re(P + Nj) 
and 
C > W log 2re(P + Ni) — W log 2reN, 


P + Ny 


W log >. 


As P increases, the upper and lower bounds approach each other, so we 
have as an asymptotic rate 
i z N 
W log - 


‘ f 
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If the noise is itself white, = .V, and the result reduces to the formu! 


proved previously: 
P 
C= Wl 1 ~~ - 
og ( + r) 


If the noise is Gaussian but with a spectrum which is not necessarily flat 
.V, is the geometric mean of the noise power over the various frequencies in 
the band WW. Thus 


N, = exp = log V(f) df 
HW Ww F 


where .V(f) is the noise power at frequency /. 
Theorem 19; If we set the capacity for a given transmitter power 7? 
equal to 
P+N-—y 
M, 
then 7 is monotonic decreasing as P increases and approaches 0 as a limit. 
Suppose that for a given power P; the channel capacity is 
is Tas N = vi 


4¥1 


C = W log 


W log 


This means that the best signal distribution, say p(x), when added to the 
noise distribution g(x), gives a received distribution r(y) whose entropy 
power is (P; + V — m). Let us increase the power to P; + AP by adding 
a white noise of power AP to the signal. The entropy of the received signal 
is now at least 


H(y) = W log 2re(P, + N — m + AP) 


by application of the theorem on the minimum entropy power of a sum. 
Hence, since we can attain the // indicated, the entropy of the maximizing 
distribution must be at least as great and » must be monotonic decreasing. 
To show that 7 > 0 as P — ~ consider a signal which is a white noise with 
a large P. Whatever the perturbing noise, the received signal will be 
approximately a white noise, if ? is sufficiently large, in the sense of having 
an entropy power approaching P + N, 


25. THe CHANNEL CAPACITY WITH A PEAK POWER LIMITATION 


In some applications the transmitter is limited not by the average power 
output but by the peak instantaneous power. The problem of calculating 
the channel capacity is then that of maximizing (by variation of the ensemble 
of transmitted symbols) 


H(y) — H(n) 
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subject to the constraint that all the functions f(/) in the ensemble be less 
than or equal to VS, say, for all ¢. A constraint of this type does not work 


out as well mathematically as the average power limitation. The most we 


. . - S . * 
have obtained for this case is a lower bound valid for all yy? an asymptotic 


oF S ; aE ae , 
upper band | valid for large =) and an asymptotic value of C for \ smal] 


Theorem 20: The channel capacity C for a band HW" perturbed by white 


thermal noise of power .V is bounded by 


C > W lba— —., 
~ ae? N 


where S is the peak allowed transmitter power. For suthiciently large 


) 
—~ §+N 
Te 


C < 'W log (1 + e) 


; ; S 
where € is arbitrarily small. As =e 0 (and provided the band W starts 


ee ee 
( on ( + *) 


We wish to maximize the entropy of the received signal. If y is large 


at 0) 


this will occur very nearly when we maximize the entropy of the trans 
mitted ensemble. 

The asymptotic upper bound is obtained by relaxing the conditions on 
the ensemble. Let us suppose that the power is limited to S not at every 
instant of time, but only at the sample points. The maximum entropy of 
the transmitted ensemble under these weakened conditions is certainly 
greater than or equal to that under the original conditions. This altered 
problem can be solved easily. The maximum entropy occurs if the different 
samples are independent and have a distribution function which is constant 
from —VYVSto+7yS. The entropy can be calculated as 


W log 48, 
The received signal will then have an entropy less than 


W log (485 + 2weN)(1 + €) 








644 BELL SYSTEM TECHNICAL JOURNAL 


with e— 0 as — — « and the channel capacity is obtained by subtractin; 


the entropy of the white noise, W log 27eN 
2 
S+N 
W log (48 + 2reN)(1 + €) — W log (27eN) = W log 7 y (1+. 


This is the desired upper bound to the channel capacity. 

To obtain a lower bound consider the same ensemble of functions. Let 
these functions be passed through an ideal filter with a triangular transfer 
characteristic. The gain is to be unity at frequency 0 and decline linearly 
down to gain 0 at frequency 1”. We first show that the output functions 
of the filter have a peak power limitation S at all times (not just the sample 

sin 2rWt 


joints). First we note that a pulse - 
poe 2nWt 


going into the filter produces 
1 sin’ Wt 
2 (xWt)? 


in the output. This function is never negative. The input function (in 
the general case) can be thought of as the sum of a series of shifted functions 


sin 2rlVt 
1 = 
"On We 


where a, the amplitude of the sample, is not greater than y S. Hence the 
output is the sum of shifted functions of the non-negative form above with 
the same coefficients. These functions being non-negative, the greatest 
positive value for any / is obtained when all the coefficients a have their 
maximum positive values, i.e. 1S. In this case the input function was a 
constant of amplitude +/S and since the filter has unit gain for D.C., the 
output is the same. Hence the output ensemble has a peak power S. 

The entropy of the output ensemble can be calculated from that of the 
input ensemble by using the theorem dealing with such a situation. The 
output entropy is equal to the input entropy plus the geometrical mean 
gain of the filter; 


Ww W " “as 
[ log G’ df = | ion (" WV . df = —2W 


a 


Hence the output entropy is 


45 
W log 45 — 2W = W log ~ 


¢* 
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and the channel capacity is greater than 


AS 
W log N° 
ree ] 


: ‘ ‘ By : 
We now wish to show that, for small V (peak signal power over average 


white noise power), the channel capacity is approximately 


C Wl (1 :) 
= og . 
ig a N 


. eo a S as ‘ 
More precisely C/W log (1 + 4 — las v >Q. Since the average signal 


power P is less than or equal to the peak S, it follows that for all V 


C<W log (1 + ) <i log (1 re +): 


Therefore, if we can find an ensemble of functions such that they correspond 
to a rate nearly W log (1 + *) and are limited to band W and peak S the 


result will be proved. Consider the ensemble of functions of the following 
type. A series of / samples have the same value, either ++W/S or — V5, 
then the next / samples have the same value, etc. The value for a series 
is chosen at random, probability } for ++/S and 5 for —yVS_ If this 
ensemble be passed through a filter with triangular gain characteristic (unit 
gain at D.C.), the output is peak limited to +S. Furthermore the average 
power is nearly S and can be made to approach this by taking ¢ sufficiently 
large. The entropy of the sum of this and the thermal noise can be found 
by applying the theorem on the sum of a noise and a small signal. This 
theorem will apply if 


ht 


N 


is sufficiently small. This can be insured by taking y small enough (after 


tischosen). The entropy power will be S + NV to as close an approximation 
as desired, and hence the rate of transmission as near as we wish to 


W log ¢ us =) 
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PART V: THE RATE FOR A CONTINUOUS SOURCE 
20. Fipetity EVALUATION FUNCTIONS 


In the case of a discrete source of information we were able to determine a 
detinite rate of generating information, namely the entropy of the under- 
lying stochastic process. With a continuous source the situation is con 
siderably more involved. In the first place a continuously variable quantity 
can assume an infinite number of values and requires, therefore, an infinite 
number of binary digits for exact specification. This means that to transmit 
the output of a continuous source with exac/ recovery at the receiving point 
requires, in general, a channel of infinite capacity (in bits per second) 
Since, ordinarily, channels have a certain amount of noise, and therefore a 
finite capacity, exact transmission is impossible. 

This, however, evades the real issue. Practically, we are not interested 
in exact transmission when we have a continuous source, but only in trans- 
mission to within a certain tolerance. The question is, can we assign a 
definite rate to a continuous source when we require only a certain fidelity 
of recovery, measured in a suitable way. Of course, as the fidelity require- 
ments are increased the rate will increase. It will be shown that we can, in 
very general cases, detine such a rate, having the property that it is possible, 
by properly encoding the information, to transmit it over a channel whose 
capacity is equal to the rate in question, and satisfy the fidelity requirements. 
A channel of smaller capacity is insufficient. 

It is first necessary to give a general mathematical formulation of the idea 
of fidelity of transmission. Consider the set of messages of a long duration, 
say T seconds. The source is described by giving the probability density, 
in the associated space, that the source will select the message in question 
P(x). A given communication system is described (from the external point 
of view) by giving the conditional probability P.(y) that if message x is 
produced by the source the recovered message at the receiving point will 
be y. The system as a whole (including source and transmission system) 
is described by the probability function P(x, y) of having message x and 
final output vy. If this function is known, the complete characteristics of 
the system from the point of view of fidelity are known. Any evaluation 
of fidelity must correspond mathematically to an operation applied to 
P(x, y). This operation must at least have the properties of a simple order- 
ing of systems; i.e. it must be possible to say of two systems represented by 
P\(x, vy) and P(x, y) that, according to our fidelity criterion, either (1) the 
first has higher fidelity, (2) the second has higher fidelity, or (3) they have 
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equal fidelity. This means that a criterion of fidelity can be represented by 


a numerically valued function: 
o( P(x, y) 


whose argument ranges over possible probability functions P(x, \ 

We will now show that under very general and reasonable assumptions 
the function v(P(x, y)) can be written in a seemingly much more specialized 
form, namely as an average of a function p(x, y) over the set of possible values 
of x and y: 

(P(x, v)) [ P(x, v) p(x, y) dx dy 
To obtain this we need only assume (1) that the source and system are 
ergodic so that a very long sample will be, with probability nearly 1, typical 
of the ensemble, and (2) that the evaluation is ‘‘reasonable” in the sense 
that it is possible, by observing a typical input and output x; and y,, to 
form a tentative evaluation on the basis of these samples; and if these 
samples are increased in duration the tentative evaluation will, with proba- 
bility 1, approach the exact evaluation based on a full knowledge of P(x, y). 
Let the tentative evaluation be p(x, vy). Then the function p(x, y) ap- 
proaches (as T — =x) a constant for almost all (x, v) which are in the high 


probability region corresponding to the system: 
p(x, v) — 2( P(x, y)) 


and we may also write 


pix, ¥) = P(x, y)p(x, y) dx, dy 


since 


P(x, y)dxdy = 1 
This establishes the desired result. 

The function p(x, vy) has the general nature of a “distance” between x 
and y. It measures how bad it is (according to our fidelity criterion) to 
receive y when x is transmitted. The general result given above can be 
restated as follows: Any reasonable evaluation can be represented as an 
average of a distance function over the set of messages and recovered mes- 
sages x and y weighted according to the probability P(x, v) of getting the 
pair in question, provided the duration 7 of the messages be taken suth- 
ciently large. 

9It is not a ‘‘metric’’ in the strict sense, however, since in general it does not satis! 
either p(x, v) = p(y, x) or p(x, vy) + ply, 2) > p(x, z 
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The following are simple examples of evaluation functions: 
1. R.M.S. Criterion. 


= (x(t) — y(t)” 


In this very commonly used criterion of fidelity the distance functi: 
p(x, vy) is (apart from a constant factor) the square of the ordinar 
euclidean distance between the points x and y in the associated functio 


space. ( 


® ig , 
p(x, y) = 7 i [x(t) — y(2)]}° dt 


i) 


Frequency weighted R.M.S. criterion. More generally one can apply 
different weights to the different frequency components before using an 
R.M.S. measure of fidelity. This is equivalent to passing the difference 
x(t) — y(t) through a shaping filter and then determining the average 
power in the output. Thus let 


e(t) = x(t) — y(t) 


and 
So = e(r)k(t —r)dt 
then 
17. : 
p(x, y) = T i f(y’ dt. 


3. Absolute error criterion. 


a 


eo id 
o(x, 9) = 2 [jay — yo [ae 
0 


4. The structure of the ear and brain determine implicitly an evaluation, or 
rather a number of evaluations, appropriate in the case of speech or music 
transmission. There is, for example, an “‘intelligibility” criterion in 
which p(x, y) is equal to the relative frequency of incorrectly interpreted 
words when message x(/) is received as y(/). Although we cannot give 
an explicit representation of p(x, y) in these cases it could, in principle, 
be determined by sufficient experimentation. Some of its properties 
follow from well-known experimental results in hearing, e.g., the ear is 
relatively insensitive to phase and the sensitivity to amplitude and fre- 





quency is roughly logarithmic. 
5. The discrete case can be considered as a specialization in which we have 
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tacitly assumed an evaluation based on the frequency of errors. The 
function p(x, y) is then defined as the number of symbols in the sequence 
y differing from the corresponding symbols in x divided by the total num 
ber of symbols in x. 


27. THE RATE FOR A SOURCE RELATIVE TO A FIDELITY EVALUATION 


We are now in a position to define a rate of generating information for a 
continuous source. We are given P(x) for the source and an evaluation 
determined by a distance function p(x, vy) which will be assumed continuous 
inboth.rand y. Witha particular system P(x, y) the quality is measured by 


,= | p(x, vy) P(x, y) dx dy 


ve 


Furthermore the rate of flow of binary digits corresponding to P(x, y) is 


P(x, y) 
K = I P(x, v) log (Oe OY. 
: waaay v)P(y) | i 


We define the rate R; of generating information for a given quality 7 of 
reproduction to be the minimum of R when we keep 2 fixed at 2 and vary 
Py). That is: 


e ) 


Ry = Min I] P(x, y) log c 


~x)P(y) 


(x, y) 


dx dy 


Py(u) 


subject to the constraint: 


y= I| P(x, y)p(x, y) dx dy. 

This means that we consider, in effect, all the communication systems that 
might be used and that transmit with the required fidelity. The rate of 
transmission in bits per second is calculated for each one and we choose that 
having the least rate. This latter rate is the rate we assign the source for 
the fidelity in question. 

The justification of this definition lies in the following result: 

Theorem 21: If a source has a rate R; for a valuation 7 it is possible to 
encode the output of the source and transmit it over a channel of capacity ¢ 
with fidelity as near 7; as desired provided Rj < C. This is not possible 
if Ri > C. 

The last statement in the theorem follows immediately from the detinition 
of R, and previous results. If it were not true we could transmit more than 
C bits per second over a channel of capacity C. The first part of the theorem 
is proved by a method analogous to that used for Theorem 11. We may, in 
the first place, divide the (x, y) space into a large number of small cells and 
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represent the situation as a discrete case. This will not change the evalu 
tion function by more than an arbitrarily small amount (when the cells a 
very small) because of the continuity assumed for p(x, y). Suppose tha 
P(x, y) is the particular system which minimizes the rate and gives R;. W 
choose from the high probability y’s a set at random containing 


y(Ri t+ e)7 


members where ¢—> 0 as 7 — x«. With large 7 each chosen point will be 
connected by a high probability line (as in Fig. 10) to a set of x’s. A calcu 
lation similar to that used in proving Theorem 11 shows that with large 7 
almost all x’s are covered by the fans from the chosen y points for almost 
all choices of the y’s. The communication system to be used operates as 
follows: The selected points are assigned binary numbers. When a message 
x is originated it will (with probability approaching 1 as 7 — «) lie within 
one at least of the fans. The corresponding binary number is transmitted 
(or one of them chosen arbitrarily if there are several) over the channel by 
suitable coding means to give a small probability of error. Since Ry < (¢ 
this is possible. At the receiving point the corresponding y is reconstructed 
and used as the recovered message. 

The evaluation 7; for this system can be made arbitrarily close to 2% by 
taking 7 sufficiently large. This is due to the fact that for each long sample 
of message x(/) and recovered message y(/) the evaluation approaches 2 
(with probability 1) 

It is interesting to note that, in this system, the noise in the recovered 
message is actually produced by a kind of general quantizing at the trans- 
mitter and is not produced by the noise in the channel. It is more or less 
analogous to the quantizing noise in P.C.M. 


28. THe CALCULATION OF RATES 


The detinition of the rate is similar in many respects to the definition of 
channel capacity. In the former 


vl P(x, y) 
R = Max I P(x, v) log : dx dy 
Pz(y) Je : ~ P(x) P(y) ; 
with P(x) and 7% = I P(x, v)p(x, vy) dx dy fixed. In the latter 
me P(e. 9) 
( Min I| P(x, y) log <—. ax dy 
P(z) ds : P(x) P(y) } 


with P,(y) tixed and possibly one or more other constraints (e.g., an average 
power limitation) of the form A = s/ P(x, y) A(x, y) dx dy. 
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A partial solution of the general maximizing problem for determining the 


rate of a source can be given. Using Lagrange’s method we consider 


: Pit, -v) -, 
I| P(x, y) log | t pw P(x, y)plx, y) + v(x) P(x, y)| da 


(xr) PI vy) 
The variational equation (when we take the first variation on P(x, \ 
leads to 
Pu(x) = Blix) < a 
where A is determined to give the required fidelity and B(x) is chosen to 


satisfy 
{ Bix)eO” de = 1 


This shows that, with best encoding, the conditional probability of a cer- 
tain cause for various received y, ?,() will decline exponentially with the 
distance function p(x, vy) between the x and y is question. 

In the special case where the distance function p(x, y) depends only on the 
(vector) difference between x and y, 

p(x, vy) = p(x — y) 
we have 


| B(x)e nell dx is 


Hence B(x) is constant, say a, and 


Np(r—y 
P,(x) = ae 


Unfortunately these formal solutions are difficult to evaluate in particular 
cases and seem to be of little value. In fact, the actual calculation of rates 
has been carried out in only a few very simple cases. 

If the distance function p(x, y) is the mean square discrepancy between 
x and y and the message ensemble is white noise, the rate can be determined. 
In that case we have 

R = Min [|H(x) — H,(x)| = H(x) — Max H,(x) 
with N = (x — y)°. But the Max H/,(x) occurs when y — x is a white noise, 
and is equal to HW, log 2re V where IV, is the bandwidth of the message en- 
semble. Therefore 
R = W, log 2reV — W, log 2reV 

Wy1 © 
= (=. = 
N 


where (Q is the average message power. This proves the following: 
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Theorem 22: The rate for a white noise source of power Q and band | 
relative to an R.M.S. measure of fidelity is 


a : QV 
R = Wy, log V 
where .V is the allowed mean square error between original and recover: 
messages. 
More generally with any message source we can obtain inequalities bound 
ing the rate relative to a mean square error criterion. 
Theorem 23: The rate for any source of band WW is bounded by 


Wilke <2 < Wi kee © 
1 og NV ee ~ 1 og N 


where () is the average power of the source, Q; its entropy power and N the 
allowed mean square error. 

The lower bound follows from the fact that the max H/,(x) for a given 
(x — y)° = N occurs in the white noise case. The upper bound results if we 
place the points (used in the proof of Theorem 21) not in the best way but 
at random in a sphere of radius VQ — N. 
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APPENDIX 5 
Let S; be any measurable subset of the g ensemble, and S_ the subset of 
the f ensemble which gives S; under the operation 7. Then 
Si — TDS 
d " e . “ 
Let H” be the operator which shifts all functions in a set by the time X. 
Then 
Lo Ano TIP o 
HS, = HTS, = TH'S: 
° mye lf . ° . » . ys 
since T is invariant and therefore commutes with H°. Hence if m|.S] is the 
probability measure of the set S 


m|HS;) = mlTH’S,] = mlH"S.] 
= mlS2] = m[S,] 
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ere the second equality is by definition of measure in the g space the 
third since the f ensemble is stationary, and the last by definition of g meas- 
ure again, 
l'o prove that the ergodic property is preserved under invariant operations, 
et S; be a subset of the g ensemble which is invariant under H", and let S, 
be the set of all functions f which transform into S;. Then 
No \mno a Lo . 
H A = H TS2 — TH So - Sy 
h , . . . . - . . 
so that HS; is included in S; forall A. Now, since 
Lo . 
ml HS] = mls, 


this implies 


h 


HS, = Ss 


for all X with m[S.] ~ 0,1. This contradiction shows that S; does not exist. 
APPENDIX 6 
The upper bound, Ns < \, + No, is due to the fact that the maximum 
possible entropy for a power VY, + V2 occurs when we have a white noise of 
this power. In this case the entropy power is V; + Vo. 
To obtain the lower bound, suppose we have two distributions in » dimen- 
sions p(x,;) and g(x,) with entropy powers VN, and Ny. What form should 


i's 


p and g have to minimize tke entropy power .N; of their convolution r(2: 
r(x) = [ plvdalx — y;) dy;. 
The entropy H; of r is given by 


H; = —| r(x;) log r(x;) dx;. 


We wish to minimize this subject to the constraints 


H, = -| t(x,) log t(x,) dx, 
HH, = = g(x) log g(x) dx; : 
We consider then 
U = -| [r(x) log r(x) + Af(x) log f(x) + ug(x) log g(x)] dx 


bU 


-| [1 + log r(x)]ér(x) + ALL + log p(x)]6¢(x) 


+ ull + log g(x)dq(x)|] dx. 
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If p(x) is varied at a particular argument x, = s,, the variation in r(. 
or(x) g(x S,) 
and 
6U = = g(x; — s,) log r(x;) dx; — d log p(s;) = 0 


and similarly when g is varied. Hence the conditions for a minimum are 


q(x; — s;) log r(x,) = —A) log p(s,) 
p(x; — s;) log r(x;) = —p log QS<). 


. 


If we multiply the first by p(s,) and the second by q(s;) and integrate wit! 
respect to s we obtain 


Hs = — X H, 
H; = —p Hy 
or solving for \ and w and replacing in the equations 
Hy q(x; — S;) log r(x;) dx; = —H,; log p(s;) 
Fis Ly. — SJE FER;z) dx; = —Tl, log p(s;). 
Now suppose p(.x,) and g(x,) are normal 
p(x) = ok exp — 32A;;xi% 
(2a )*/2 rare 
' B;; |"" 
q(x;) = ‘~*~ exp — 42B;;x;%x;. 
(2x)*/2 


Then r(x;) will also be normal with quadratic form C;;. If the inverses of 
these forms are a;;, 6;;, ¢;; then 


Ci; = Gi; + 5,;. 
We wish to show that these functions satisfy the minimizing conditions if 


and only if a;; = Ab,; and thus give the minimum //; under the constraints. 
First we have 


; n 1 i . 
log r(x;) = : log 1Caj | — 42C 4; 25%; 
2 Qn 


P ; n 1 : . ‘ 
q(x; — s;) log r(x;) = ~ log 1Cij | — $2Cijsis; — 42Ci;5;;. 
: 2 Qn . 
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his should equal 


4 bg en Pa eee? 
H; 2 2r ‘ 
ich requires «1 =e c 
which requires A,; = if 
pica i” Hs 
H = 
In this case A;; = 7 B,, and both equations reduce to identities. 


APPENDIX 7 


rhe following will indicate a more general and more rigorous approach to 
the central definitions of communication theory. Consider a probability 
measure space whose elements are ordered pairs (x, vy). The variables x, y 
are to be identified as the possible transmitted and received signals of some 
long duration 7. Let us call the set of all points whose x belongs to a subset 
S, of x points the strip over S;, and similarly the set whose y belongs to S, 
the strip over S.. We divide x and y into a collection of non-overlapping 
measurable subsets Y,; and J’; approximate to the rate of transmission R by 


1 
R, ” >» P( >, a J 3) log 


PUX;, ¥ 4) 
P(X;)P(V,) 
where 
P(X,) is the probability measure of the strip over Y 
P(Y,) is the probability measure of the strip over ] 
P(X,, 1) ,) is the probability measure of the intersection of the strips 


A further subdivision can never decrease R,;. For let NV, be divided into 


Ay = x; + ry and let 


P(V;) =a P(X;) = b+ 
P(X) = b P(X), V1) = d 
P(X7) = ¢ P(X7, ¥)) =e 


PUXs. } 1) = d + € 
Then in the sum we have replaced (for the Vy, }; intersection) 


d+e ( e 
(d + e) log by d log + ¢ log 
alb + €) : ad ad 


It is easily shown that with the limitation we have on 6, ¢, d, e, 
ite le 
d + e - de 


< 


b + c _ 64 ¢ 
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and consequently the sum is increased. Thus the various possible subdi 
sions form a directed set, with R monotonic increasing with refinement 
the subdivision. We may define R unambiguously as the least upper bou 
for the R; and write it 


R = a [| P(x, y) log Bie Be dx dy. 
2 e iid of i) at j 

This integral, understood in the above sense, includes both the continuous 
and discrete cases and of course many others which cannot be represented 
in either form. It is trivial in this formulation that if x and w are in one-to 
one correspondence, the rate from u to y is equal to that from x toy. Ii 

is any function of y (not necessarily with an inverse) then the rate from x to 
y is greater than or equal to that from x to 2 since, in the calculation of the 
approximations, the subdivisions of y are essentially a finer subdivision of 
those for 2. More generally if y and 2 are related not functionally but 
statistically, i.e., we have a probability measure space (v, v7), then R(x, v) < 
R(x, y). This means that any operation applied to the received signal, even 
though it involves statistical elements, does not increase R. 

Another notion which should be defined precisely in an abstract formu 
lation of the theory is that of “dimension rate,”’ that is the average number 
of dimensions required per second to specify a member of an ensemble. In 
the band limited case 2W numbers per second are sufficient. A general 
definition can be framed as follows. Let fa(/) be an ensemble of functions 
and let p;[fa(t), f3(4)| be a metric measuring the “distance”’’ from f, to / 
over the time 7 (for example the R.M.S. discrepancy over this interval. 
Let V(e, 6, 7) be the least number of elements f which can be chosen such 
that all elements of the ensemble apart from a set of measure 6 are within 
the distance € of at least one of those chosen. Thus we are coverinz the 
space to within € apart from a set of small measure 6. We deiine the di 
mension rate \ for the ensemble by the triple limit 


; , . log N(e, 6, T) 
\ = Lim Lim Lim —, 
50. acead Soon T log 
This is a generalization of the measure type definitions of dimension in 
topology, and azrees with the intuitive dimension rate for simple ensembles 
where the desired result is obvious. 
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Transients in Mechanical Systems 


By J. T. MULLER 


INTRODUCTION 


A study of the response of an electrical network or system to the input 
of transients in the form of short-duration pulses is an accepted method of 
analysis of the network. By comparing the input and the output, conclu- 
sions may be drawn as to the respective merit of the various components. 

Until recently similar procedures were only of academic interest with 
mechanical systems. However, the tests for mechanical ruggedness, which 
are required of electronic gear in order to pass specifications for the armed 
forces, are an example of the application of transients to a mechanical sys- 
tem. These tests are known as High Impact Shock Tests. 

A basic part of an electrical system is a damped resonant network consist- 
ing of an inductance, a capacitance and a resistance. A mass, a spring and 
a friction device is the equivalent mechanical network called a simple me 
chanical system and a combination of such networks is a general mechanical 
system. It is, of course, advantageous to keep the mechanical system as 
simple as possible without detracting from the general usefulness of the 
results obtained. 

The problems here considered are pertinent to a system which is essen- 
tially made up of a supporting structure or table and a resilient mounting 
array bearing the equipment (e.g. electronic gear) which is vulnerable to 
shock. (See Fig. 1.) 

A shock is the physical manifestation of the transfer of mechanical energy 
from one body to another during an extremely short interval of time. The 
order of magnitude of the time interval is milliseconds and quite frequently 
fractions of a millisecond. 

The system is excited by administering large spurts of mechanical energy 
to the supporting table. The manner in which this energy is supplied to the 
base and the way it is dissipated through the system are the subjects of this 
paper. 

The energy transfer to the supporting table is accomplished by the use 
of huge hammers which strike the anvil with controllable speeds. The 
action is assumed to be similar to that of an explosion, particularly to an 
underwater explosion at close range or a near-miss. As to the real compar- 
ison between the two, the reader is referred to the various manuscripts 
published by the Bureau of Ships. This particular phase of the subject is 
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considered outside the scope of this paper, except for the following brief 
statement: 

Both actions fit the definition of shock stated above and the difference be- 
tween the two is one of size and not of kind. 

Shocks are transients and are conveniently treated by a branch of math 
matics which is adapted to the solution of problems of this kind; viz, the 
Laplace Transforms, and the reader is referred to Gardner and Barnes, 
“Transients in Linear Systems.’”’ The nomenclature used here is identical 
to that of those authors. 


EQUIPMENT 
(MASS,M) 





RESILIENT 














_ MOUNTING ARRAY 
= STIFFNESS, k) 
<4 ee ( , 
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oe eA jp 
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Fig. 1—Schematic layout of shock machine. 


The manuscript consists of two parts: In the first, the energy transfer to 
the base is considered. We are dealing here with rigid bodies; consequently 
with very small transient displacements and very large forces. These are 
usually referred to as impact forces or impulses and four such functions of 
force and time are discussed. Displacements with associated velocities re- 
sult from the action of impulses on the base. 

The second part deals with the effect of these displacements on the shock- 
mounted equipment. Although the mathematical procedure is identical to 
the first part, here we deal with a function of displacement and time. There 
is no specific name for such a relationship but a suggestive term is “whip.” 
However, the pulse functions represented are the same as those of the force 
and time function. 

It is assumed that the displacement-time pulse is independent of the subse- 
quent motion of the mass. 

In considering any kind of shock problem we have the following funda- 
mental considerations: 
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First, we shall want to know the magnitude of the shock present in the 

ase or supporting structure; this will be called the “excitation.” 

Second, the behavior of the resilient medium interposed between the 
shock-producing base and the equipment. It is sometimes expressed as the 
coupling. We shall use the term transmission. 

Third, the resulting disturbance of the equipment caused by the trans 
mitted shock, which we will call the response. 

The three functions do not exist independently, but are mathematically 
related. For a clearer understanding of shock phenomena it is perhaps 
helpful to fix in one’s mind the idea that the response of a system is com- 
pletely dependent upon the transmission function. 

To use an electrical analogue, the voltage e (¢) impressed upon a system 
produces an output voltage e, (¢) which is completely defined by the trans- 
mission function. For instance, if this transmission function represents a 
filter of some kind with given boundaries, then it is to be expected that the 
response of é (¢) is completely changed outside these limits and could even 
be zero. The same train of thought will hold for mechanical systems. Here 
the transmission function is mostly represented by the stiffness or the com- 
pliance. For a completely rigid medium the stiffness would be infinite and 
the input and output would be alike; in other words, a force applied to the 
base would appear at the equipment. This is a theoretical case because no 
material is perfectly rigid. Though some materials are more rigid than 
others they will all give if the force applied is big enough. Now the forces 
associated with a shock are almost always of considerable magnitude so that 
the stiffness of a material becomes significant. 

As the stiffness diminishes the response changes and may appear to be 
quite different from the input. As far as the transmissibility of forces is 
concerned, the reader is reminded that a force is always accompanied by a 
reaction. The forces which put the base into motion cannot be transmitted 
by a soft material like rubber, unless it is compressed to extremely high 
values, and thus produce an equally large reactive force. 


PART I 
ANALYSIS OF THE EXCITATION OF THE BASE 


By recording the motions of the base, we obtain time-displacement curves 
as shown in Fig. 2. The method of recording has been done by means of 
high-speed motion pictures (at the Whippany testing laboratory using a 
Fastex) and by using strain gages (at the Annapolis Engineering Experiment 
Station). 

The test equipments are fundamentally mechanical impact producing 
machines. For technical details and description of the machines the reader 
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is referred to the various test specifications by the Bureau of Ships (as, 
example, Spec. 40T9). 

The characteristic of an impact is the transfer of mechanical energy fri 
one mass to another in a relatively short time. The corresponding force 4s 
function of time is called an impulse, henceforth indicated as F(t). A stu 
of the pulse functions has suggested some probable theoretical shapes of / 
which could cover a wide variety of conditions. These pulse functions 
be used for force-time functions as well as displacement-time-functions a1 
it will be shown that the results are surprisingly similar. 

We will let these pulses operate on the base with mass m, and calculate and 
plot the resulting time displacement curves. Since an impulse is associate 


; oes, a . : ~ my ; . rae 
with energy transfer, it must be a function of From the point of viey 
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TiMeE=—> mM- 2 MILLISECONDS 


c 


SPLACEMENT 
_— 


D 








Fig. 2—Time displacement record of medium high impact machine. 


of shock action, the final velocity v7 is extremely important, for it is this 
velocity which will determine the displacement and acceleration of the 
shock-mounted equipment. 

To distinguish the various applications of the pulse functions, the follow- 
ing notations are adopted: 

f() represents any functions of /, without reference to its dimensional 
magnitude. The transform of f(t) is indicated by F(s). 

x(t) represents a function of ¢ when it is a displacement of the ma:s m 
only. The transform is indicated by X(s). 

xi(¢) represents a function of ¢ when it is a displacement of the dase (with 
mass m,) only. The transform is indicated by Y,(s). 

F(t) represents a function of ¢ when it is a force applied to the base. The 
transform is indicated by Fo(s). 

Since x;(¢) and F(¢) are input functions, they may be represented by the 
same type pulse, in which case the transforms are alike, i.e., F(s) = Xi(s) = 
Fo(s). 

Figure 3A, a rectangular pulse, is the simplest form. 

Figure 3B is a triangular pulse, f(/), reaching a peak and returning to zero 
in a linear manner. 
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Figure 3C consists of one-half cycle of a sine wave. 
Figure 3D is a cosine pulse of one-cycle duration and is shifted along the } 


axis an amount equal to the amplitude. 
| 


These are the pulses to be used in the problems under consideration 

If they represent a force as it varies with time then it is said that / 
represents a particular pulse. The Laplace transform of F(¢) is given as 
Fy(s), Fo(s) being some function in the complex domain. It is outside the 
scope of this paper to prove or show the mathematical technique in ob 
taining the transforms which produce Fo(s). We will present them here for 


future reference. 
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Fig. 3—Four pulses 
The Laplace transform for a very short pulse is 
F(s) = A 1.01) 


and is a pulse which has a finite area but the time interval of which is 
approaching zero. 
For a square pulse with finite time interval and magnitude a (hereafter 


referred to as pulse amplitude) it is 


. nil, 
F(s) = a (1.02) 
AY 
For a triangular pulse 
be \2 
: a 1—e , 
F(s) = (1.03) 
b 5S 
For a sine pulse 
awy , r8/u 
F(s) = (1+. ) (1.04) 
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For a shifted cosine pulse 


2 
aw Qe #0) 


F(s) (1 


(1.0 


9 


2s(s? + w%) 


Suppose we let an impulse, and to take a specific example, a triangular 
impulse, operate on a mass m;. We have 


F = may (1.06 
in which 

F = F(t) = force in lbs. 

m, = mass in slugs 

ay) = #, = acceleration in ft/sec’ 


Let £[x,(t)] = Xi(s) = X; 
then 


L[m x1] = ms” Xi(s) 


the & transform of a triangular pulse F(é) is 


be 2 
LF (t)] = t : ) = F,(s) (1.07) 
b s 


be\2 
wt. ae J (: Whe. ) 
nb s? Ss 


The inverse transform is 


be\ 2 
£7(X4(s)] = a(t) = Tt] 2 a’ : ) (1.08) 
myb s? s 
bs\ 2 
n(t) = °. g J —s (1.09) |) 
mb s? 5 


The solution of 1.09 


3 3 3 a“ 
wih = af 72 u(t — b) + (i — u(t — 20) | (1.10) 


Substituting 





my b 6 6 


After the impulse is over, i.e., for values of ¢ > 2b, 1.10 becomes 


a(t) = & We — 6) (1.11) 
my 








ngular 


(1.06 


1.07) 


| .08) 


09) 


.10) 


11) 
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and the final velocity is 


b. (1.12) 


my 


which represents the area of the impulse divided by the mass. 
Similarly we find for the very short square impulse 


A, 
x1 = —l (1.13) 
my 
For the square impulse of finite time interval 4 
ab b 
1 = — (: — 5) (1.14) 
my, 2 


For the sine impulse 


) 
od T - 
x = — t‘- — (1.15) 
MW 2a 


For the shifted cosine impulse 


an Tv 
x= t-— 
NW WwW 


The velocity is the term preceding the term in parenthesis. 
In the five examples mentioned, we find that this velocity is proportional 
to the area of the impulse curve and inversely proportional to the mass. All 


(1.16) 


a . ~ p - 
~ and, since ais the maximum force present, 
Mm 


this expression represents the maximum acceleration and it is this value 
which is so frequently mentioned when discussing the actions on the shock 
table. 

For instance, from records we have determined approximate values for 
the time interval during which the energy transfer from hammer to the table 
takes place. The high-speed motion pictures are taken at the rate of 4,000 
to 5,000 frames per second, which means an average elapsed time of .22 
milliseconds or 220 microseconds. The energy transfer occurs within this 
time interval, because the rate of increase of the displacement from frame 

1 1] 
12000 second or 83 
microseconds. If the anvil moved within this time there would be evidence 
of blurring. Since we have been unable to detect any blurring, we may state 
that transfer is less than 220us yet more than 80us. 

Let us assume it to be 100us. That means a pulse width of 25 = 100us. 
(See Fig. 3.) 


expressions contain the factor 


to frame is constant. The exposure time of one frame is 
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a ° 
—~ = ado = acceleration, 
my, 
then 
ab 


= - = dpb 
my 


For a 2000-ft. pound shock the table speed v is approximately 7 ft. ‘se: 
Substituting, we find for ay or the acceleration 


7 = a X .00005 
or ad) = 140.000 ft./sec.” 
do = 4400 “‘g’’s 


This is about the order of magnitude which the accelerometers have 
recorded. 

The important conclusion we draw from this is that the acceleration and 
its time interval combine to produce a velocity of the base which is a com 
plete criterion of the severity of the shock administered. 

In the example just cited the weight of the table is approximately 800 Ibs , 
and the force 4400 x 800 = 3,250,000 Ibs. The result is, then, that a tri 
angular impulse of 3,520,000 Ibs. magnitude and a duration of 100us operat 
ing on a table of 800 Ibs., imparts to that table a velocity of 7 ft. ‘sec. 


PART II 
ANALYSIS OF THE RESPONSE 


In Part I the origin of the motion of the base has been treated. This mo- 
tion. of the base can now be represented by a pulse or a displacement as a 
function of time. To distinguish the displacement-time function from the 
force-time function, we have already suggested the name Whip. Obviously 
some of the pulse shapes which were used to represent impulses are not suit- 
able as whips. For instance, the square pulse as whip could not exist, since 
this would suppose an infinite velocity. 

The triangular whip is observed in the medium-high-impact shock ma- 
chine. The sine whip may be taken to represent approximately the output 
of the light-high-impact machine. 

The shifted cosine whip is sometimes used in the motion of cams of auto 
matic equipment. 

The problem of shock response is now reduced to the behavior of a mass 
and spring system when the base motion is represented by a whip. 
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Triangular whip (Fig. 3b). The Laplace transform of this pulse is 


7.) _ a laws ia, 
Fis) = 2 ( AY : 


lhe differential equation for a simple harmonic system is 


mé + kx = 0 (1) 
or 
x 
-+x = 0. (2) 
“a 


If we let the whip operate on this system, then 


~ +x = f(t) = (0) 3) 


in which x(t) represents the displacement of the whip as a function of time. 
Let L[x(t)] = X(s) 


and 
Llai(D] = LUf(O) = F(s) = Xils) 


then 
£{2) = sX(s) — sf(O) -- f'(0) 
By definition the initial conditions are zero, so that 


[#] = 2X(s) (4) 


The Laplace transform of equation (3) is then 


il X(s) + NX(s) = LUf(O) = Nils) 


re) 


or 


~ 


Ak 


(° + “) x(s) == X,(s). (5) 


Now 


Substituting and rearranging, 


, ad 1 —e “~ 
X(s) = - (6) 
S) 2+ wd ( 2 ) ) 
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This is the transform equation. To find x we use the inverse Laplace 
transform and the solytion of (7) is 


aw” t sin wl (¢—b sin w(t — b 
c= : I(4 _ ao) = 2( ee — ar ) u(t bas b) 


‘ = ee sin w(t — 0) At — 2. 
a w? 


The expression u(t — 6) simply means that the term to which it is attached 
is zero for all values of ¢ < 0d. 

Let us now consider what this solution consists of. 

There are apparently three terms which take effect at successive intervals 

The initial whip can be considered to consist of three different displace- 
ments starting at successive times 0, 6 and 2b. With the displacement of 
the base there is a corresponding displacement of the mass m. After the 
time 5 the second term or displacement takes hold and an associated dis- 
placement of mass m except that the initial conditions are the end conditions 
of the first displacement. After the time 2d the third displacement enters 
and the final result is the displacement-time pulse or whip. To make the 
problem somewhat simpler we introduce the following modifications: 

1°. Because the motion is a simple harmonic of known frequency after 
the whip has passed we will only consider the maximum amplitude. 

2°. Only the displacement-time function of the mass m during the pulse 
interval will be examined. 

3°. The dimensional magnitudes of the motion of mass m will be expressed 
as ratios of those of the pulse. 
If a is the maximum amplitude of the whip, and 7) = 26 its time interval 
(usually expressed in milliseconds), then we define 


il Amplitude ratio of pulse displacement and response of mass m dur- 


a ! ; 
ing pulse interval only. 


OP is ian = Natural frequency of mass m expressed as a 


Ya i 2n/w 1 ratio of the pulse length. 


(7) 


lt 


re Elapsed time expressed as a ratio of the pulse length. 
Xm x * . . > 
A= => Ratio of maximum amplitude to pulse displacement after pulse 
interval. 
Substituting these values in equation (7) and rearranging we obtain 
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This looks somewhat complicated, but we can simplify by omitting the last 
term, because we are only considering values of 6 during the pulse interval 


ll a ali! 
sate ~ TW 5 ((2 ~g- See D) (2s -~1) (9) 


Te re 


A plot of this equation for various values of ¢ is shown in Fig. 4. It is seen 
that 6 becomes a maximum when ¢ is approx. .9 and 7 is then .75. The dis- 
placement is approximately 1.5 times the peak displacement of the whip. 

After the whip has passed, or when 7 > 1, the transient has disappeared 
and a steady-state condition exists. Since the system under consideration 
is a simple harmonic system, the steady state is a harmonic motion of fre- 
quency w, with an amplitude to be obtained from equation (8). Indicating 
the dimensionless values of the amplitude by 6, when 7 > 1, equation 8 may 
be written 


fe at te sin 2grt _ 2 ((2r _ 4 — sin g(27 — =] 


fa) Te 
(10) 
sin 2rg(r — 1) 
+ (2. —i)- _ =m) a 
Trey 
After developing (10) and rearranging we obtain 
2(1 — cos mg) . 
ba = ( = sin wr¢(2r — 1). (11) 
Tre 
The maximum amplitude is 
— 2(1 — eos re) (12) 
ry 


A plot of equation (12) is shown in Fig. 5. Before considering the action of 
this whip in terms of what it does to the system, we shall take a brief look at 
the analysis of the two other whips; viz., the sine whip and shifted cosine 
whip (see Fig. 3). 


Sine Whip 


We have again equation (3). 
no8 +-2= fli) = »(b) 


and equation (5) 


2 2 
(c++) X(s) = F(s) 
w® 
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From 1.04 














A dW, , i 
F(s) = = etree  ): (13) 
Ss + Wa 
Substituting (13) in (5) we obtain 
X(s) da*w & 
> (Ss = 2 » 9 ” (1 + € 2 ). (14) 
(s + Ww Ys + wo) 

2 16 

ne | 

: w 1.2 + + + - + + + 4 

“aia | 

2 | 

a7 | 

ai- 

<)a 

-i> 08 + + + + + + + + 

vid | 

win 

—4 | 

ff 

e1? o.a te | ; | | 

$\% 

Paty | } 

ui «COG L L JL 

< Oo 0.5 1.0 1.5 2.0 2.5 3.0 3.5 40 45 5.0 
- PULSE LENGTH (INTERVAL) _ 4, 


NATURAL PERIOD OF SYSTEM 
Fig. 5—Maximum amplitude as a function of frequency ratio-steady state tri 
angular whip. 


The inverse transform gives us 


dw*wy 1 : 1 , 1 F 
t= 5 9 SIN wol — sin wl } + - SIN wo (f — 
WO — Wo Wh WwW Wo WwW 
| T Tv 7 
— snw(i — —)} u(t — —) (15) 
WwW Wy Wy 


and dividing this into two parts again, the transient and the steady state, 


we find for the transient, 


dW"Wy 1 . ‘ 
r= 9 2] — SIN aol — sin wl 
— ™ oe Wh Ww 


and substituting the dimensionless quantities 


A 


.¢ 


S = x TT / Wo 
? T 


and 


we find 


F : .. . , 
6 = sin tr — sin 2r¢gr }. (16) 
2¢ 











670 BELL SYSTEM TECHNICAL JOURNAL 


A plot of this equation as a family of curves for various values of ¢ is shown 
in Fig. 6. It is noted that, in general, this group of curves resembles those 
of Fig. 4 of the triangular whip. 
The steady state is 

dw* wo 


1 1 
fom J .| SIN wot — -sin wl 


qo — ee wo Ww 


—_ T is. cy 
+ — sinuwy (t — ~) — —sinw (t — ~) 

wo wo Ww Wo 
and, in dimensionless quantities or expressed as a ratio of the pulse dimen 


sions, we obtain 


dy 


6, = — Ay 


(03 we sin 2rye(r — 1). (18) 


From (18) it follows that the maximum amplitude of the steady state is 


4 co3 Te 


Ao ee. | 
1— 4 (19) 


A plot of this curve is shown in Fig. 7. 
Shifted Cosine Whip. 


The shifted cosine whip produces results of a similar nature. We have 
seen that the transform equation for this whip is (1.05) 


9 
dao 


F(s) = 
(s 2s(s? + wi 


— gion) (20) 


Using equations (3) and (5) and transferring to dimensionless quantities, in 
which 


x 27 / ww w t 
=5 Mayet, p= 
7 Wo Qr Wo 2r 


wo t 
we obtain 


6 = Ue — 1) (co: 2rer — ¢ cos 2nr | 
(21) 
— cos 2rg(r — 1)u(r — 1) + ¢ cos 2e(r — 1)u(r — »). 
Since we are interested only in the transient displacement, (21) becomes 


sictad (1 — cos 2mgr) — ¢ (1 — cos 2a) 


22) 
2(1 — ¢*) ( 


A family of curves showing 6 for various values of ¢ is shown in Fig. 8. 
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The steady state after the transient is (from Eq. 21) 


1 : 
i, = Ue me (co: 2rer — ¢ cos 2rr 


— cos 2rg(r — 1) + ¢ cos 2x(r — 1) ) 


which reduces to 









































‘ sin Tye 
6, = , sin (2rg7r — my). (23 
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Fig. 7—Maximum amplitude as a function of frequency ration-steady state sine whip 


The maximum amplitude is 


(24) 
a plot of which is shown in Fig. 9. 
Practical Considerations 


Let us now consider the action of these various whips in terms of what 
they do to the system. The designer of shockmounts is primarily interested 
in the displacement across the mount or the relative displacement of base 
and mass. 

In Fig. 10 the relative transient displacements for four systems are shown 
when subjected to a triangular whip. 


The natural frequencies are .4, 1.0, 
From this it appears that the 
maximum relative displacement is approximately equal to the maximum 


1.5, and 2 times the frequency of the whip. 
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whip displacement. It is also observed that the large relative displacement 
occur when the frequency of the system is smaller than the pulse frequency) 

After the transient has passed, the relative steady-state displacement 
which is of course equal to the absolute, obtains large values too. 

From Fig. 5 we note that a maximum of 1.5 is reached for the triangula: 
whip and up to 1.7 times for the sine whip (see Fig. 7) at a frequency oi! 
approximately } of the whip. Apparently even larger displacements 
across the mount occur after the transient has disappeared. 

This is illustrated in Fig. 11 for the same systems as in Fig. 10. 

As ¢ increases, which means if the frequency of the system increases with 
respect to the pulse frequency, the displacements across the mounts diminish, 
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Fig. 9-—-Maximum amplitude as a function of frequency ratio-steady state shifted 
cosine whip 


while on the other hand the acceleration increases as will be shown later (see 
equation 39). 

From this it seems advantageous to select a natural period of the system 
at least twice that of the pulse frequency. 

The relative displacements are limited by practical considerations, such 
as available space between cabinets and bulk head, cable connections, per- 
sonnel safety and others. 

In the design of Bell Telephone Laboratories radar equipment, the rela- 
tive displacement has been held to one-half inch, and the natural frequency 
in the neighborhood of 35 to 40 cycles per second or a period of 25 to 30 m.s. 

The average of the heaviest shock administered to this type of equipment 
has a peak amplitude of 1.5 inch and a time interval of approximately 60 m.s. 

From Fig. 5, we find that under these conditions a maximum relative dis- 





ian ALANNA APE ON 


pean asa bi emanate 8s 








TRANSIENTS IN MECHANICAL SYSTEMS 675 





1é€n' ; 0.8 > — [aneeinaeieiaen ti 
NC} . | - P 

1en| : 3 

C ouPRESS ON 
ula x 
y of —_ | 
ents 















































04 e— —+—__--- —+— —+ 
with 
/ u 0.6 H+ —- 4 
nish, 
1.2 — i i 
0.4 T T 
all Af. 2) COMPRESSION : 
t) : a 2 a 
| 
ae | | 
© 0.4 jp} ate - 
> 
= bes } 
< 
aw 0.8 -}——+—-—_ +--+ 
; « | 
‘ | 
: 1.2 L l 
0.8 oe 
oe i | 
5.0 : | @=1.5 
i 0.4 
ifted ; 
0 
| 
(see 
04+ 
em | 
: 0.8 i dies 1 } J 
uch : 0.4 a @ T — 
’ Qo 2 | 





acy 
n.S. ° 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 


ent ELAPSED TIME ca 


PULSE LENGTH NTERVAL 


la- 


| 
; i AECOMPRESSION | Mn 

j ) 7 

H 








1.8. 
lis Fig. 10—Transient time displacement curves across the mount for various values ol ¢ 


triangular whip. 











676 


1.6 


Fig. 11 


BELI 


SYSTEM TECHNICAL JOURNAL 










COMPRESSION 


=0.4 
} } a 









TENSION. 








} } 
@ =1.0 
— + + + 

4 | 

| 

| 

| 
} d 4 | i 

| 


COMPRESSION *:: 


+ + + oo | eee 





| 
} + 
y 
ENSION 

+ + 

} + + 

| 

| 











COMPRESSION 





COMPRESSION 



































Steady state time 


).3 0.4 0.5 0.6 0.7 0.8 0.9 1.0 
ELAPSED TIME 
SE LENGTH (INTERVAL) 


displacement curves across the mount. Triangular whip. 











TRANSIENTS IN MECHANICAL SYSTEMS 677 


placement of .42 times the peak pulse amplitude or approximately ~ inch 
may be expected. 

Taking into consideration that the shock mount has been designed with a 
certain amount of damping, it is thus possible to hold the relative displace- 


ment within the boundaries of its shock-absorbing capacity. 
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Fig. 12—System with damping 


Viscous Damping 


The fundamental differential equation for a system with damping is (see 
Fig. 12). 


été + 2+ wx = 0 (25) 
If we let a whip operate on this system we obtain 
¥+ 2% + wx = w 44(/) (26) 


However, the sudden displacement of the base also produces an acceleration 
of the mass proportional to the velocity. If x:(¢) is the displacement then 
4,(t) may be represented to be the velocity and 2%;(¢) the acceleration. We 
have, then, for the completed equation 


# + We + wx = w(t) + 264 (0) (27) 
In the Laplacian terminology, if 


xi(t)$="F(s) 
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then 
%(t) = sF(s) (initial value being zero) 
The Laplace transform of equation (27) is 


wo + 2¢s 


X(s) = ith 
Xs s? + 2fs + w? 


F(s). (28 


The solution of (28) is made easier if it is written in the form 


we + 2as 


MO Cae 


F(s) (29 
in which 
a=f¢ and a? + B? = w. 


Subjecting this system to a triangular whip, of which the Laplace trans- 


form is 
F(s) _ a p ro a 
j b $ 


‘ w+2as a ( —€ a 
X(s) = es a)? + BB : (30) 


the solution of which involves two transform pairs. The inverse transform 
gives us a solution of the transient as well as the steady state. It has been 
mentioned before that the steady state produces the maximum displace- 
ments across the mount; therefore it will be considered in more detail. We 
find that the steady state solution is 


we have 


at 
ae 


x(t) = ; (—sin Bt + 2” sin B(t — 6) — &™ sin B(t — 2)) (31) 
Which simplifies to 


at 
€ a 
B b 


Using dimensionless quantities 


x(t) = V/ (A? + B?) sin (8t — 6) (32) 


and St be 
a 


{ a 
w 
and the substitution 
Vines, 
we find that 
bB = buy = rey 











ar 


u 
+ 
< 
+ 
V 
> 
c 
< 
u 
+ 
‘ 





29 


ns- 


30) 
rm 
en 


Ce- 


Ve 


1) 


}2) 








TRANSIENTS IN MECHANICAL SYSTEMS 679 
Equation (32) may now be expressed as 


at 
ba = “— VW(A? + BY) sin (wyt — 0) at > 2qmy (33) 














we 
in which 
B 
tan é = — 
A 
and 
, 2 
A = —1-+ 2e”* cos myy — €™” cos 2ryy (34) 
P 2n ° = 
B = 2e”* sin rey — €™* sin 2z¢y (35) 
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Fig. 13—Effect of damping on steady state amplitude for triangular whip 


From equation (33) we obtain the maximum displacement 


at 
A = “— V/A? + B at > Inte (36) 
Te 


in which 
, B 
at = (tant? + tan 4) at > 2nre 
Y ” A 


In Fig. 13 a plot of equation (36) is shown for 7 = .5. This indicates that 
the peak value of A is .24 as compared to 1.48 when no damping is present. 


Accelerations 
The transient accelerations of the mass m during the whip action and the 


subsequent steady state may be found by examining the acceleration during 
the first part of a triangular whip. Designating the velocity of displacement 
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of the whip by v the expression for x may be formed from equation (7) 
setting ¢ < b. 


uw [t sin wt 
cu — ( - a2), t<b (3; 


6b \w? w3 
° a 
and, since - = 1 
b 
i 
x = vt —-— sin wt (38 
Ww 
whence 


% = vw sin wt 
and the maximum acceleration is 
Ao = WW (39) 


Let 


= No then No => l 
vw 
By proceeding in a similar manner with the next step of the whip the value 
of the acceleration ratio will be found to be 


do = 3 
and for the completed whip or steady state 
No = 4 


The expression 49 = Aotw is an important factor in shock considerations. 
Thus we havea simple relation for the final maximum amplitude of the periodic 
acceleration of the mass m when subjected to a triangular whip; viz., it is 
four times the product of whip velocity and natural frequency of the system. 
The constant Ao depends upon the configuration of the whip; the velocity 
v indicates the intensity of the whip; while w expresses the kind of response 
the system is capable of. 

It is of interest to note that this maximum periodic value of Ao will be pro- 
duced only if the ratio of pulse frequency and natural frequency is of the 
correct value. It is difficult to produce shocks on existing equipment of 
exactly the same characteristics within narrow limits as to time duration 
and therefore it must expected that a considerable variation in damage 
may occur even though similar shocks are administered to identical test 
objects. For the same reason a shock of lower intensity may produce 
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more damage than a hizher one because impulse amplitude as well as dura 
tion change at the same time. 

Although damping is a highly desirable feature in a shock mount, the 
damping device may cause a certain amount of coupling between the massand 
the base, and if too much damping is provided the transient acceleration of 
the mass may become excessive. 

The analysis of this problem by means of the Laplace transforms is not 
difficult, for we can use results previously obtained. The transform equa 
tion for a system with damping, subjected to a whip, is 


w? + 2as 


X(s) = - » F(s) (40) 
(sta)+ fs 

in which F(s) represents the transform of the disturbance or excitation. 

Since we are interested in the effect of the damping or y upon the response, 

only the first part of the triangular whip will be considered. 


In this case 


x(t) = wu 
and 
ixi(s)| = Fs) = —. 41) 
<2 
Substituting (41) in (40) 
” e(w* + 2as) 
X(s) = » = Fi(s) (42) 


sl(s + a) + BI 


If X(s) is the transform of x(/), a displacement, then the acceleration is ¥(/ 
or g(t), (¥(4) = g(t) by definition) and 


L{z(t)] = Lig(t)| = s*Xs 


Substitution in (42) gives 


oe 
§ ) 

2 2a ‘J 

¢s X(s) = 20a (43) 


(s+ a)? + B? 


Now £>"[s?X(s)] = g(d) 
so that 


). [~ 2 2 5 
g(t) = —— ( _ a) + 8 | « ® sin (8t + yp) $4) 
8 2a 
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Fig. 14—Transient acceleration during initial part of triangular whip. 
Letting 
g(t) a ¢ 
=i, a ee B=w V1 — 9 = wy 
tw (6) Ww 
and 
wl = A 
Substituting 
—9 
= sin (AO + w) (45) 
in which 
2ny 
tany = >— 7-3 
1 — 2n 
Figure 14 is a plot of \ against 4 for various values of n. It is noted that 
for » = .5 of critical damping the initial acceleration is equal to the un- 


damped, \ being one. 

The data presented here are also applicable to long duration pulses, be- 
cause the final results have been given in dimensionless quantities, the units 
of measurement being those of the pulse. 


SYMBOLS USED 


Mass a ee mM 
Mass of base....... _™ 
Spring stiffness...... is iitoeiests Sanaa 
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Displacement mass. x, x(t) 
Displacement base x1, X(t) 
Force on base... F, F(t) 
Velocity of base t 
Acceleration of base ~~’ 
Acceleration of mass m #, #(t), g(t) 
Maximum acceleration of the mass m Ag 
Maximum acceleration ratio he = 2 . 
Acceleration ratio = = 
Natural frequency of mass m (circular) w, Bo 
Circular (angular) displacement wt = A 
Frequency of sinusiode of which pulse consists (not pulse frequency) wo 
Peak pulse displacement a 
Pulse period (triangular) 2b 
™ (sine pulse) ~ ; 
Wo > To 
; 2 
(shifted cosine pulse) om 
Period of mass m.. z 
displacement during pulse period _ . a 
peak pulse amplitude a 
amplitude steady state _ ; 
peak pulse amplitude . 
max. amplitude steady state _ A 
peak pulse amplitude 
elapsed time = t t/m t/tew t | 
pulse length (interval) | rr ree, eal 
pulse length (interval) To _ 
natural period of system , 
Damping coefficient. fl, a 
Critical damping ratio -=7 
Transform of x(/) = X(s) 
. “m(l) = X,(s) 
“ “ F@) =... F,(s) 
“ «4% = F(s) 


f(t) represents any function of ¢, without reference to its dimensional 


magnitude. 











Maximally-flat Filters in Waveguide 


By W. W. MUMFORD 


Microwave radio relay repeaters require the use of band-pass filters which 
match closely the impedances of the interconnecting transmission lines and which 
suppress adjacent channels adequately. A type of structure called a Maximally 
Flat tilter meets these requirements. 

The ladder network which gives a maximally-flat insertion loss characteristic 
is discussed and several methods of achieving its counterpart in microwave trans 
mission lines are presented. Resonant cavities are used to simulate tuned circuits 
and the necessary formulas relative to this approximate equivalence are given. 

Experimental data confirm the theory and show that this technique vields 
remarkable impedance matches 


INTRODUCTION 

E USUALLY associated the word filter with any device which is sele« 

tive. The electric wave filter has that property which enables it to 
transmit energy in one band or bands of frequencies and to inhibit energy in 
other bands. Selectivity is the result of either selective absorption! 
or selective reflection. This paper discusses a special case of the classical] 
lossless transducer which derives its selective properties entirely from selec- 
tive reflection. The insertion loss of this type of filter can be analyzed in 
terms of the input reflection coefficient and the input standing wave ratio. 

In many applications of lossless filters it is desirable to obtain a character- 
istic such that the insertion loss, and hence the reflection coefficient, is smal! 
over as wide a band as possible. A special case described here, referred to 
as a maximally-flat filter, has a loss characteristic such that a maximum 
number of its derivatives are zero at midband. While the maximally-flat 
type of characteristic does not give the smallest possible reflections over a 
finite pass band, it does give small reflections, and has added advantages of 
simplicity in design and in many cases less transient distortion than filters 
giving smaller reflections. 

The desirable characteristics of maximally-flat filters have long been 
realized.':* Mr. W. R. Bennett* of these Laboratories derived the con- 
stants for a maximally-flat ladder network in the late 20’s, and gave simple 
expressions for the element values. Butterworth,’ Landon® and Wallman’ 
have treated maximally-flat filter-amplifiers in which the filter sections are 
separated by amplifier tubes. Darlington® has considered the general case 
of four terminal filters which have insertion loss characteristics that can be 
prescribed, but he places the emphasis more on filters that have tolerable 


* A list of selected references appears at the end of the paper. 
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ripples in the pass-band than on maximally-tlat structures. The work of 
Bennett will be followed closely not only because it came first, but also be 
cause it is easy to understand. 

Bennett expresses the values of the filter branches in terms of their cutoff 
frequencies, which in turn bear a relationship to the cutoff frequencies of the 
total filter. In the language of one who is familiar with microwave tec} 


nique,®: *. 


the values of the filter branches can be expressed in terms of 
the loaded Q's of the cavities, which in turn bear a relationship to the loaded 
( of the total filter. A simple mathematical expression connects the loaded 
0 to the cutoff wavelengths. 

At low frequencies the band-pass maximally-flat filter is made up of 


resonant branches connected alternately in series and in parallel. The 


microwave analogue of this contiguration is obtained by the use of shunt 


resonant cavities that are spaced approximately a quarter wavelength apart 
in the waveguide. Use is made of the impedance inverting property of a 
quarter wave line, thereby eliminating the necessity of using both series and 
parallel branches. 

The resonant cavity in the waveguide resembles a shunt resonant tuned 
circuit, but is different in several minor respects. An analysis of these 
differences reveals the corrective measures that are necessary in order that 
the simulation shall be sufficiently accurately attained. 

The first part of the paper deals with the concepts of loaded Q and reso 
nant filter branches of both the series and the parallel types. Admittance 
and impedance functions, as well as expressions for the insertion loss, are 
given using these terms, and the relationship between loaded Q and cutoff 
frequencies is stated. This con ept of loaded Q is then introduced to de 
scribe the performance of a complete maximally-flat filter in terms of its 
cutoff frequencies. The insertion loss is then given as a simple expression 
containing the total Q and the resonant frequency. The (Q's of all the 
branches are derived from the total Q in simple terms. The connection be- 
tween the insertion loss and the input standing wave ratio is then discussed 
before turning to the actual design problem. 

Next the paper deals with the application of the filter theory to wave 
guide technique. The limitations of the quarter-wave coupling lines are 
pointed out and the added selectivity due to them is derived. 

Then the paper compares microwave resonant cavities with parallel 
tuned circuits. Formulas are given which relate the geometrical configura 
tion to the loaded Q, the resonant frequency and the excess phase of the 
cavities. Three types of cavities are treated: those using inductive posts, 
inductive irises and capacitive irises. 

Finally, the measured results on a four-cavity maximally-flat filter in 1” X 


2” waveguide are presented and compared with the original design points. 
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teen connecting lines are given. 
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As a further confirmation of the theory, the experimental results on anot 
and longer waveguide filter consisting of fifteen resonant cavities and {c 
The conclusion is reached that maxima) 
at waveguide filters can be designed to have excellent impedance match 
and off-band suppression qualities. 


NOTATION 
Width of waveguide. 
Height of waveguide. 
Normalized susceptance. 
Velocity of light in free space. 
Capacitance in the r‘" branch of a filter. 
Width of iris opening. 
Diameter of post in waveguide. 
A small number < 1. 
Base of natural logarithms. 
Frequency. 
Resonant frequency. 
Frequency at half power point. 
Cutoff frequency of waveguide. 
Terminating conductance of filter. 
drt 
A 
Susceptance parameter. 
Length of transmission line. 
Length of line corresponding to excess phase of cavity. 
Length of line connecting two cavities. 
Resonant wavelength. 
Wavelength at half power point. 
Wavelength in transmission line. 
Wavelength in free space. 
Inductance in r™ branch of a filter. 
An integer, including zero. 
Number of branches in filter. 
Available power. 
Power delivered to load. 
Loaded Q. The selectivity of a loaded circuit. 
Loaded Q of the r™ branch. 
Loaded Q of the total filter. 
Terminating resistance of the filter. 
Distance from center of waveguide. 
Voltage standing wave ratio. 
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r Thickness of iris. 

Vnaz Maximum voltage on transmission line. 
V..in Minimum voltage on transmission line. 
w Angular frequency. 

1) Frequency parameter. 

} Admittance. 

¥) | Surge admittance of transmission line. 
Z Impedance. 

Zo Surge impedance of transmission line. 
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THE Y'S ARE USED TO DENOTE 
GENERALIZED ADMITTANCE FUNCTIONS 


Fig. 1—Block diagram of a filter consisting of a ladder network 
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Fig. 2—Schematic diagram of a low-pass filter 





GENERAL 

The art of designing filters which utilize lumped elements is well known. 
Desirable characteristics may be obtained by means of a ladder network of 
generalized admittances, such as is illustrated in Fig. 1. In particular a 
low-pass filter takes the configuration shown in Fig. 2, and a band-pass filter 
that of Fig. 3. In either case, certain frequency selectivity characteristics 
can be obtained when the individual branches are assigned definite values. 
The individual branches, 1;C; , L2C2, in the bandpass filter consist of an 
inductance, L, , and capacity, C, , in series or shunt. For the specific case 
to be discussed in this paper, namely a filter consisting of lossless elements 
intended for insertion between a source having an internal resistance R 
and a receiver having the same resistance, analysis is simplified if a branch 
is described in terms of its resonant frequency and its loaded V.* The 

* The loaded Q of a resonant branch in such a filter is the reciprocal of its percentage 


band width measured to the half power points when that branch alone is fed by the same 
generator and has the same load resistance as that of the total filter, 
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resonant frequency of a branch is independent of the terminal resistance 
is given by the relation 

1 
2rv/ L,C; 
The loaded ( of the branch, to be designated Q, , is a function not only of th 
inductance and capacitance in the branch, but also of the resistance, R, o/ 
the terminations on the filter. For the series resonant branches, it is giver 


by 


és 1  /L, _ wol, 
ai 2R V a 


and for the parallel resonant branches 











Fig. 4 Schematic diagram of a band-pass filter 


It may be noted that the loaded Q can be defined in terms of the insertion 
loss imposed by connecting the branch between a source and receiver each 
of resistance R. Analysis of such a circuit shows that 


P eff for¥ 
° 1+0;(>-— (4 
P, fo , 
where f is the frequency; 
Py is the power available from a generator which has an interna! 
resistance R; 
P, is the power delivered through the inserted branch to a load of 
resistance R. 
At the cutoff frequency, /. , detined as the frequency at which the power 


) 
delivered to the load is half the available power, = 2, whence 


P, 
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Written in terms of the wavelengths this becomes 


1 


No 
0) 


[his equation is a convenient one to use later in the discussion on resonant 
cavities. 

The normalized admittance of a single-shunt branch terminated by a re 
sistance R can be expressed in terms of its resonant frequency and its Q; 


VR=1 + 720, (4 -*). (7) 
lo / 


Similarly, the normalized impedance of a single-series branch terminated 


thus 


by a resistance R can be written 


i 1+ j20, (4 -#). 8) 
R ‘ lo / 


The use of the term loaded Q thus has the advantage that expressions for 
normalized admittance and normalized impedance of shunt and series reso- 
nant circuits respectively are identical, as are also the corresponding expres 
sions for their insertion loss functions. 

Loss functions of complete filters can likewise be expressed in terms of a 
loaded Q defined for the complete filter. For example, the loss function of 
the particular type of filter called a ““Maximally-flat” filter is givent 


] fy ie 
Po 0 } 
. . : ( 
p =- ] + f. f ) 
y f 


where # is the number of resonant branches in the filter, and f, is the cutoff 
frequency of the filter (half power points). 
In consequence of the concept of loaded O of the total filter, the loss 


function can be expressed as 


Po } te @ 
OC == (10 
P, ae or (F *) as 


where the total Q, of the filter is 
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For convenience, the bracketed term of equation 9 may be called &, 4 
frequency parameter, whence 







I 
lo 
f. 
fo fe 













and the loss function becomes 


; ; 
P, 








IN DECIBELS 








INSERTION LOSS 
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Fig. 4+--Insertion loss of maximally-flat filters. 


MAXIMALLY-FLAT FILTERS 

The loss function for maximally-flat filters as given in equation 13 is 
plotted in Fig. 4 where the insertion loss in db is used on the ordinate and 
2" is used on the abscissa. 

The ladder network which gives rise to this loss function consists of » 
resonant branches, as shown in Fig. 3, that are all tuned to the same fre- 
quency, but whose selectivities, or loaded Q’s, are tapered from one end of 
the filter to the other according to the positive imaginary parts of the 2n 
roots of — 1, according to the theories of Bennett and Darlington.* These 


“i @ 4 
2n “ 


roots are expressed thus 
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dQ. ‘ , : . : 
where r is the number of the root, 7 is the total number of branches. Thus 
the selectivities of the branches follow the relation 
; 2r — 1 
0), QO, sin Ls 14 
(1 2n 
where Q; represents the selectivity of the total filter, and Q, represents the 
° * - th “j “exe a nl ‘: 
required selectivity of the r ~ branch, e.g., the selectivities of the first, second 
and third branches are 
(13 
QV; = Ursin 
n 
. 32 = 
Q. = Ursin (15) 
an 
. sr 
Q3 = Ursin : 
Qn 


This type of filter is particularly practical when a filter is required to give 
more than a certain amount of insertion loss in an adjacent band, and less 
than another certain amount of insertion loss at the edges of the pass-band. 
Putting this information in equation 10 gives two equations containing two 





unknowns, Q,, the selectivity of the total filter, and , the number of 


t branches needed to fulfill the stated requirements. The solution for x 
| may be fractional, in which event the next higher integral value of » is 
chosen, and this value is used to determine the selectivity, Vr , of the filter. 
r From this, the selectivities of all the branches are determined in accordance 
with equation 14. 
104 
STANDING WAVE RATIO 
An alternative way of specifying filter performance is to refer to the input 
impedance mismatch as a function of frequency. The impedance mis- 
3 is match can be expressed in terms of the direct and the reflected waves and in 
and terms of the standing wave ratio that exists along the transmission line that 
| connects the properly terminated filter with its generator. The standing 
fn ; wave ratio and the insertion loss of a filter bear a definite relationship to 
fre- each other if the filter is composed of purely reactive elements. This rela- 
| of tionship is given by the formula 
2n Pe (3+ 1)° a : 
ese : P, — 4S ,} 


, ; ~ Wwe > ; 
where S is the standing wave ratio, ,,""", of the maximum voltage to the 


minimum voltage as measured along the transmission line. 
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When the filter characteristic is given by equation 13, the relationship | 
tween 2, the frequency parameter, and the standing wave ratio can be « 
pressed as 
(Q)" = (1 
This is shown graphically in Fig. 5, where the standing wave ratio is give 


in db (20 logo y . This graph is used as an aid in the design of filters o! 


min 








a) 

(o) 
+ 
+ 
+ 





IN DECIBELS 





INPUT STANDING WAVE 
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Fig. 5—Input standing wave ratio of maximally-flat filters. 


this type, where the requirements are given in terms of the standing wave 
ratio. From this information the number of filter branches and the selec- 
tivity of the total filter can be determined, either from equation 17 or from 


Fig. 5. 


DISTRIBUTED BRANCHES 


It has been assumed that the mutual impedances of successive branches 
are all zero. At low frequencies this limitation may not be a serious one 
and the practical realization of the expected filter characteristics is accom- 
plished by shielding properly one branch from another. However, as the 
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frequency is increased it becomes difficult to isolate the branches and unde 
sirable mutual impedances arise which complicate the problem. In par- 
ticular, in the microwave region, where waveguides are used, the physical 
size of each branch may be large compared with the wavelength and it is 
then impossible to lump all the branches at one place in the waveguide with- 
out encountering the complicated effect of mutual impedances. 

A practical way of circumventing this difficulty is to distribute the branch 
circuits along the transmission line or waveguide at such distances that the 
mutual impedances become negligible. Then, however, the lengths of 
transmission line act as transducers, but since their properties are well under- 
stood and readily calculable this appears to be a practical solution. As a 
matter of fact, the impedance transforming properties of a length of trans- 
mission line can be used to advantage.’:'’:' For instance, it is well 
known that a quarter wavelength of lossless line transforms a load imped- 
ance according to the relation 

Zo 


™ (18 
Z, 


F 4 


where Zo is the surge impedance of the line and Z, is the load impedance 
Hence if the load impedance consists of a series resonant circuit contain 

ing an inductance, a capacity and a resistance equal to Z» in series, the im 

pedance at the input end of the quarter wavelength of line is given 


Zo 


Z = f 
1+ po ~ 4 


The input admittance is 


y= ¥o/1+ 20 (4 -/2) (20) 
6 lo f 


As can be seen from equation 7, this is identical with the input admittance 


(19) 


of a parallel tuned circuit whose terminating conductance is 
G = Yo. (21) 


The quarter-wave line likewise transforms a parallel circuit to a series 
circuit, as is illustrated in Fig. 6. This property of the quarter-wave line 
thus makes it possible to simulate a ladder network of alternate series and 
shunt branches by spacing shunt branches (or series branches) at quarter 
wavelength intervals along a transmission line, as illustrated in Fig. 7. 


The resonant frequencies and the selectivities of the branches are chosen as 
before. 
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Sometimes in practice a quarter wavelength may not be sufficient spaci 
to avoid mutual impedances arising between adjacent elements, in whi 
event the connecting line may be increased to a higher odd multiple of quar- 
ter wavelength. This accentuates the frequency sensitivity of the conne 
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YL = Yo (1+ j2n) 2’= 292 ¥9 (1+ j2n) 


Fig. 6—Illustrating the impedance inverting property of a quarter wavelength of trans 
mission line. 


LUMPED CONSTANT FILTER USING SERIES & SHUNT ELEMENTS 
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LUMPED CONSTANT FILTER USING ONLY SHUNT ELEMENTS 
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Fig. 7—-Simulation of ladder network by shunt branches at quarter wave intervals. 
ing line, but this effect can be taken into account by decreasing the selectiv- 
ities of the branches themselves by appropriate amounts. In narrow-band 
filters this may be negligible, but in broad-band filters it may be consider- 
able, as shown in the following analysis. 


SELECTIVITY OF CONNECTING LINES 


Consider a length of transmission line having a surge impedance Zp = 


y and terminated in a parallel resonant circuit containing an inductance, a 
0 
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capacitance and a resistance equal to Z) ,* as in Fig. 6. The terminating 
admittance is given by the relation (See Eq. 7 and 9) 






Vi, = Vo(1 + 722). (22) 





The input admittance at the end of the length of line, f, (nominally a quar- 





ter wavelength long) is given by the relation 









r (1 + 722) cos @ + 7 sin 0 ais 
= : 3 22) 
Yo cos 8 + j(1 + 72Q) sin @ 
where 
Jarl 
4 = 
Xr 
f = length of line 
\ = wavelength 
q= {2 - fe) 
ean 1 f 
letting 
w 2 us TO 
‘= ( 5) = + (24 
) 1+ 6 ) + ) ) 
5 RO. 76 - 
cos 9 = —sin = — (25) 
2 2 
, 76 
sin @ = cos = = 1 (26) 


where 6 is a number small compared with 1. , Then the admittance becomes 


This is the normalized input admittance of a circuit as shown in Fig. 8, 
where each end of an ideally inverting line is shunted by a tuned circuit 
75 


whose normalized admittance is j , - 


et IH SiN i 


: 7 . 2rl ee 
From Eq. 24, setting — 7 it follows that 





1). (28) 


| ; P . 7 - . 
' * More generally, the terminating admittance can assume any value without affecting 
: the result. 

} 

| 
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} 
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From equation 7, the admittance of the circuit is expressed in terms of ‘ts 


. 76 ; f fo 
= 720 ——). 2 
ae i. (f ’) | 


Solving for the selectivity of this circuit, from Equations 28 and 29: 


selectivity, thus 


T 
QO = F ( sf 
8 
The selectivity of the coupling line can hence be counteracted by sul 
tracting = from the selectivities of the branches associated with it, provided 


re 
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Fig. 8 Schematic diagram illustrating that the selectivity of a quarter wavelengt! 
of line can be represented by adding a tuned circuit to each end of an ideally inverting 
impedance transformer 








the coupling line is a quarter wavelength long. If it becomes necessary to 

. aS oe ; al a 3m 
use } wavelength coupling lines, the selectivity of the line is tripled and = 
is subtracted from the selectivities of the associated branches. 


RESONANT CAVITIES 

The foregoing analysis reviews the principles of the design of filters which 
use lumped-constant circuits distributed along a transmission line. These 
principles can be applied to the design of filters in waveguides, coaxial lines, 
or any other types of transmission lines, provided that these lines are sufh- 
ciently lossless, the band is sufficiently narrow and the branches themselves 
are realizable. In the microwave region the first two provisions are usually 
met without difficulty, as is also the third provision when circuits with dis- 
tributed constants are used. It may be difficult to construct a coil and a 
condenser circuit for microwaves, but easy to construct a resonant cavity 
which displays some of the desirable properties of the tuned circuit. Reso- 
nant cavities are similar to lumped tuned circuits in two respects.” "= They 
transmit a band of frequencies and they introduce a phase shift. An ap- 
proximate equivalence is demonstrated in Appendix I, and is illustrated in 
Fig. 9, which depicts a resonant cavity as being nearly identical with a 
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tuned circuit situated across the middle of a short length of transmission 
ine. This short length of transmission line is added in order to account for 
in excess of phase shift associated with the resonant cavity, but it can read- 
ily be absorbed in the connecting line which otherwise would have been an 
odd quarter wavelength long. 

The similarity between resonant cavities and resonant lumped circuits 
enables one to use the known art of designing lumped element filters to de- 
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RESONANT -. v ( 
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Fig. 9A resonant cavity is approximately equivalent to a resonant circuit shunted 
across a short length of transmission line 
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Fig. 10—The resonant wavelength and the loaded Q of a cavity depend upon the 
normalized susceptance of the end obstacles and their separation. 


sign filters which use resonant cavities, provided that the selectivity, the 
resonant frequency and the excess phase shift of the resonant cavity are 
known. 


RESONANT WAVELENGTH AND LOADED (Q OF CAVITIES 


These properties can best be derived by considering one of the usual types 
of cavities, which consists of two obstacles or discontinuities separated by a 
length of transmission line. Such a cavity is shown schematically in Fig. 
10. The obstacles at each end are assumed to be equal, and to have an 
unvarying susceptance BY), where Yo is the surge admittance of the con- 
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necting transmission line. This type of cavity is resonant when the rel:- 
tion is satisfied’: }* 

arf 
Xo B 


bb 


tan 


where Ao is the resonant wavelength in the transmission line, 
fis the length of the cavity 
B is the normalized susceptance of the end obstacles. 


This resonance occurs at any number of wavelengths, but the 1st or 2nd 
longest wavelength at which resonance occurs is in the region which is usu- 


ally of greatest interest. 
The selectivity in this region is determined also by the value of the nor- 


malized susceptance, B, of the obstacles, and is given by the relation (See 


Appendix I) 


? 
arc tan — 
B 


_ 
~ 
I 
. 
-_- 
w 
~ 


2 arc sin —>>—= 

© BE + 4B? 
This selectivity is based upon the wavelength, not the frequency parameter. 
In terms of the wavelength in the transmission line this is 


arf 
r . Ago 
Q = | —f— | = | _* (33) 
Qrf arf get ait Agea 
Ngel Age? 
where A,o is the wavelength of resonance in the transmission line and \,, 
is the wavelength at the half power points. If the phase velocity in the 
transmission line does not vary with frequency, then the selectivity can be 
expressed simply in terms of either the wavelength or the frequency since 
f fo do nN ‘ 
Ee a ee, (34) 
to I r do 
However, when the velocity in the transmission line varies with frequency, 
equation 34 does not hold true, and the expression relating the two parame- 
ters is more complicated. In the case of the rectangular waveguide 


Cc 


=—> > (35) 
Vf? 4 Jew 


g 


where c is the velocity of light in vacuum, f,.. is the cutoff frequency of the 


. 
waveguide, fur = 2a and a is the width of the waveguide. 





(31) 
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it can be shown readily that the frequency parameter can be expressed 
in terms of the wavelength, thus 


( f 4 = me) (“) (=) ; 
sas = i — (360) 
to | Ny Ago Ng Ago 


where Aq is the wavelength in free space. 
For narrow percentage bands, this reduces to the approximate relation 


( f ft) ( 4 - " - 
oe ane — ‘ \5/ 
Jo | Ng Ago Ago 








































































“a9 rT T T no 
PoP ER 
| | | | | 
o aw. | 
-10 . —- —>+ . 
7 Ses eee | a aes! ; a eee al 2 Sl SS 
. { | ! | [| t : { 
= | A - 
-6 oe TH | eos =| Se: S| (eres 
3 Ss im a = : . 2 a 
w 4 CAVITY WITH INDUCTIVE OBSTACLES 
y -4/-———_+— + -- p- 
 , eZ _ = 3 
- ee ee. 5 : 
a A [i8 | 3 
Vv ~ -< 
a . £2335 
i eens aa” me mae 1 ‘en 
a | me B + VB4+4B] 5,-1.2 
Pa ~ ae ie 4 B 
4 2 SIN? ——-S$— 
| VB4++4B2 
=| rn | 4 rn n r a ee 4 
1 2 3 456 8 10 20 30 40 5060 80 100 200 300 
SELECTIVITY (LOADED Q) 
Fig. 11—The relation between loaded Q and normalized susceptance. (Inductive ob 
stacles) 


This states, in effect, that the percentage bandwidth is greater in terms of 
wavelength than in terms of frequency, by the square of the ratio of the 
wavelengths in the guide and in free space. The selectivity in terms of the 
frequencies and wavelength ratio thus becomes 


1 dao fy ao ‘ 
Qs 7 ae => — - (38) 
f. £) Ago (fee — fer) his 
fo fe 
This is the selectivity that is plotted as a function of B in Figures 11 and 12. 
EXxcEss PHASE AND CONNECTING LINES 
The excess phase of this type of cavity is taken into account by adding the 
lengths of line, ¢’ (see Fig. 9), which have a length given by the relation (See 


Appendix I) 


’ ) 
tan def =— (K) ‘ (39) 
ij B 
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Combining Eq..31 and Eq. 39 and solving for ¢’ in terms of ¢ 
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where fis the length of the cavity and Ayo is the resonant wavelength in t 
line. 
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Fig. 12 The relation between loaded Q and normalized susceptance. (Capacitive ol 
stacles) 

Thus, when this length, corresponding to the excess phase of the cavity 
resonator, is absorbed in the length of line connecting two cavities together, 
the correct total connecting length becomes 


(. = (2m + 1) _ ~f~ 
ate », Ay 


where f, and 2 are the lengths of the cavities and m is any integer including 
zero. 


OBSTACLES IN WAVEGUIDES ? 


The three properties of the cavity—the resonant frequency, the selectiv- 
ity and the excess phase—are given in Equations 31, 32 and 39, regardless 
of the sign of the normalized susceptance, B. In the case where the obstacles 
are inductive, B is negative; and where the obstacles are capacitive, B is 
positive. 
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Further explanation is needed to distinguish between these two impor 
tant cases. First consider the case where inductive obstacles are used. 


fan = is negative and the cavity length lies between a quarter and a half 
Ago 

wavelength (plus any multiple of half wavelength). The selectivity, as 
given by equation (32), is plotted on Fig. 11 for the fundamental mode 
lhe excess phase is positive, and the added lengths, ¢’, of Fig. 9 are positive 
Che connecting lines between two such cavities are then slightly less than a 
quarter wavelength (or odd multiple thereof). 

Next consider the case where the obstacles are capacitive. Tan og is 

—_ 
positive and the cavity length lies between zero and a quarter wavelength 
plus any multiple of half wavelengths). The selectivity as given by equa- 
tion (32) is plotted in Fig. 12 for cavity lengths lying between a half wave- 
length and three quarters wavelength. The excess phase is negative and 
the added lengths, ¢; of Fig. 9 are negative. The connecting lines between 
two such cavities are then slightly longer than a quarter wavelength (or odd 
multiple thereof). 
SUSCEPTANCE OF OBSTACLES 

The Equations (31), (32) and (39) give the resonant wavelength, the selec- 
tivity (in terms of wavelength) and the excess phase as functions of the nor- 
malized susceptance of the obstacles which form the ends of the cavity, and 
a knowledge of this susceptance as a function of the geometrical configura- 
tion of the obstacle is necessary to complete the design of the filter. At low 
frequencies, conventional coils and condensers can be used to form the dis- 
continuities in the transmission line; while at high frequencies, transmission 
line stubs can be used.“ In the microwave region, where waveguides are 
employed, obstacles having the shapes shown in Figures 13, 14, and 15 can 
be used.” 

INDUCTIVE VANES 

Figure 13 shows a plane metallic obstacle, transversely located acyoss a 
rectangular waveguide, with a centrally located rectangular opening extend- 
ing completely across the waveguide in a direction parallel to the electric 
vector. For thin obstacles, the normalized susceptance can be calculated 


from the approximate formula,’® 


a ) wi 
B+ — —cot — (42) 
ad 2a 


where A, is the wavelength in the waveguide, a is the width of the waveguide, 
and d is the width of the iris opening. 
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When the iris is constructed of material of finite thickness, 7, the expressio 
for the susceptance is more complicated,"’:" and the equivalent circu: 
becomes a four-terminal network with both shunt and series element: 
The equivalent shunt susceptance of this network can be obtained experi 
mentally by measuring the insertion loss of the iris, from which a curve suc! 
as shown in Fig. 16 can be computed. These data* were taken for irises 
.050” thick in waveguide having internal dimensions of 0.872” & 1.872” i: 
the frequency range around 4000 mc. The ordinate is a parameter, K, from 
which the normalized susceptance is calculated: 


B=K (3) . (423 
2a 
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Fig. 13—One type of inductive obstacle in rectangular waveguide. 
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Fig. 15— Another type of inductive obstacle consists of a cylindrical post. 


Along the abscissa is plotted the ratio of iris opening to width of the wave- 
guide. 

It can be demonstrated that for values of K from —1 to — 20, the equiv- 
alent iris opening is approximately the actual opening less the thickness of 
the metal sheet. For practical purposes, when the susceptance lies be- 
tween — 1.5 and — 30, it is often sufficient to use the approximation, 


‘ ,—- 3 
B= — Xo cot" = “4 (44) 
ad 


2a 
where 7 is the thickness of the iris. 


* Data supplied by Mr. L. C. Tillotson of Bell Telephone Laboratories. 
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CAPACITIVE IRISES 
The normalized susceptance of infinitely thin capacitive obstacles, as illus- 
trated in Fig. 14, may be calculated by the approximate relation” 
+b wd 


ee log, cosec (45) 


Ne 26 
where 6 is the height of the waveguide, A, is the wavelength in the waveguide, 
and d is the width of the iris opening. 
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Fig. 16—Experimentally determined curve of normalized susceptance of inductive irises 


As with the inductive vaneg, the normalized susceptance is a function of 
the iris thickness and may be calculated from the approximate formula” 


dar [6b d 
B = By —_— (46) 
— (; ) 


g 


where By is the normalized susceptance of the infinitely thin iris, and 7 is 
the iris thickness. 

For best results, the irises should be designed from experimentally deter- 
mined curves, however. 
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INDUCTIVE Posts 

The normalized susceptance of the round cylindrical inductive post, ce: 

trally located in the waveguide parallel to the electric vector, may be calcu 
lated from the approximate formula! '5. 16 
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Fig. 17;-Experimentally determined curve of normalized susceptance of inductive posts 


The experimentally determined values of susceptance are somewhat less 
than the values calculated by the formula (47). The difference is less than 

" d. ree . 4 
20% when ~ is less than 0.08. A curve of experimentally determined values 

€ ! » 

¢ 

is plotted in Fig. 17, the data being taken in rectangular waveguide 0.872” < 
1.872” at a frequency near 4000 mc.* 

The normalized susceptance of posts is also a function of their position in 
the waveguide, the susceptance decreasing as the posts are moved off center. 
This feature may be used when it is desired to make all the posts in a filter 


* Data supplied by Mr. A. E. Bowen of Bell Telephone Laboratories. 
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from stock of a given diameter. The expression for the normalized sus- 
ceptance of off-center posts is given by the relation" 


2X 1 
B= —-— 1s 


a 2 7S da ws\ } 
sec — log. — (5 — 
a 7 de as ij 


where s is the distance off center. 


EXPERIMENTAL DATA 

The principles of waveguide filter design as outlined in the foregoing have 
been used in several applications. For example the channel branching fil 
ters in the New York-Boston microwave radio relay link consist of two reso 
nant cavities separated by the equivalent of { wavelength sections of 
waveguide. The transmitting modulators in this relay system also use 
two-chamber filters to separate the wanted sideband from the unwanted 
sideband. The transmission band in each of these applications was 10 m« 


- 
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Fig. 18—Diagram of a transmission line filter consisting of four resonant cavities and 
three connecting lengths of line. 
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and the image frequency or the unwanted sideband which was to be re 
flected was 130 mc away. 

In another case the requirements were that the standing wave ratio should 
be less than 0.64 db over a band of 20 mc and more than 28 db 30 me on 
each side of the midband frequency. The design formulae indicated that a 
filter consisting of four cavities would be needed. These, then, would take 
the general configuration shown in Fig. 18, where the first and last cavities 
are formed by the obstacles 7B; and the length of line ( , while the two mid 
dle cavities are formed by the obstacles 7B; and the length &. The lengths 
f2 and f2. correspond to the transforming sections of transmission line which 
connect the cavities tegether. The loaded Q’s required to meet the speci- 
fications turned out to be Q; = 12.25 and Q2 = 30.0, after allowance had 
been made for the selectivities of the ? wavelength connecting sections. 
Assuming that the cavities would be formed with inductive obstacles, as 
shown schematically in Fig. 19, the susceptances to obtain these selec 
tivities were obtained from Fig. 11 based on equation 32. This gave 


B, = 4.08 


By = — 6.36 
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These susceptances were realized with centrally located round posts, for 
which the data of Fig. 17 has been plotted, and this filter was constructed 
according to the calculated dimensions which are shown in Fig. 20. Each 
of the four cavities was tuned separately to resonance near midband by ad- 
justing a capacitive plug located in the center of each. The characteristic 
then obtained is plotted in Fig. 21, which shows that the standing wave 
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Fig. 19—A four-cavity filter which utilizes inductive obstacles. 
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Fig. 20-—The calculated dimensions for a four-cavity maximally-flat filter in 0.872” x 
1.872” rectangular waveguide. 
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Fig. 21—-Measured characteristic of four-cavity filter of Figure 20. 


ratio met the design points quite well. These are shown as circles in the 
figure. The insertion loss of this filter was less than 0.7 db over a 25-mce 
band and less than 0.3-db at midband. 

Another maximally-flat waveguide filter consisting of fifteen resonant 
cavities gave an insertion loss of two decibels at midband, 4-db loss at 20- 
me bandwidth and 40-db loss at 30-mc bandwidth. The input standing 
wave ratio was less than 1.0 db over a 20-mc band. Its characteristics are 
plotted in Figs. 22 and 23. This excellent performance is remarkable in 
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Fig. 22—Measured standing wave ratio of maximally-flat filter consisting of fifteen 
resonant cavities. 





60 — 1 


55 


nN Ww Ww a 
Wn °o Wn °o 


INSERTION LOSS IN DB 
nN 
°o 











O20 4630 ; 4040 4050 4060 


FREQUENCY IN MC 





Fig. 23-—— Measured insertion loss of the fifteen-cavity filter 


view of the difficulties that might be encountered in constructing and align 
ing a filter consisting of 75 discontinuities and 29 lengths of waveguide. 
Its physical length (over 80”) may be seen in the photograph of Fig. 24 
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The theoretical treatment of maximally-flat filters presented here has ig- 
nored the dissipation in the elements. Better agreement between expected 
and observed characteristics would be obtained if this had been taken into 
account. The observation of .3-db loss and 2-db loss in the four-cavity and 
the fifteen-cavity maximally-flat filters is indicative of the amounts of added 
insertion loss to be expected because of dissipation in the elements. In 
addition to the increased loss at midband, we should expect a rounding of the 
insertion loss characteristic near the cutoff frequencies, and a broadening of 
the standing wave characteristic at frequencies well beyond cutoff. In 
many applications, however, these effects can be ignored. 


CONCLUDING REMARKS 


In the foregoing, the design of maximally-flat band-pass filters has been 
treated in detail. The treatment of other types of band-pass and band- 
rejection filters is beyond the scope of the present paper, although much of 
the material presented here may be of use in designing such filters. In 
fact, almost any filter consisting of a ladder network «f inductive and capaci- 
tive elements in series and in shunt can be simulated in waveguides by fol- 
lowing these principles. Emphasis on the maximally-flat filter has been 
deliberate for two reasons: it gives a type of transmission characteristic that 
is useful in microwave work; it is simple to design. 
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APPENDIX I 

A cavity resonator, consisting of a length of transmission line, f, at each 
end of which there is an unvarying susceptance, 7B, is approximately equiv- 
alent to a tuned circuit, consisting of an inductance, 1, and a capacity, C, 
in parallel located at the center of a short length of transmission line, 2¢’, 
when these two conditions are satisfied: 

(1) The square root of L over C is equal to the surge impedance of the 
transmission line divided by twice the loaded Q of the cavity. 

(2) The sum of the lengths of the two transmission lines f and 2¢’ is equal 
to a half wavelength at resonance. 

The first of these conditions follows from equation 3 of the text above, 
and the proof of the second condition will be given in the following analysis, 
based on the schematic drawing of Figures 9 and 10. In this analysis, the 
loaded () of the cavity is derived in terms of the susceptance of the obstacles 
at its ends. 

Since the cavity and the tuned circuit are both symmetrical it is adequate 
to consider but one half of each in establishing the equivalence. Then by 
setting the short circuit admittance of one equal to the other and setting the 
open circuit admittance of one equal to the other, the necessary relationships 
are derived. 

The following symbols will be used in addition to those used in the text: 

Vse = Normalized admittance, short circuited. 


Voc = Normalized admittance, open circuited. 


The subscripts 1 and x refer to the cavity and the equivalent tuned circuit 


respectively. 
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The short-circuited admittances of half the cavity and half the tuned cir- 
cuit are 


V sci _ j(By — cot 0;) (A1) 
V sce = —4 cot 6, (A2) 


while the open-circuited admittances are 


Yoo. = j(B; + tan 4) A3) 
.[B 
i( = + tan 0) 
V ocz = B ° A4) 
Ir 
1 — = tan 6, 
Putting Vsci = V sco 
1 a 
tan 6, = (AS) 


cot 6; — B, 
Putting AS in A4 and setting Vou = Voc: 


Bs 1 
: ,—-B 
B, + tan 6; = 2 tdi Bi j (A6) 
Be 
2 cot A; = B, 


Solving for B, we have 


B, == B,(B, sin 26, — 2 cos 26) A7) 
which becomes 
; ~ . 2wrl fro 
b, V Bi + 4857 sin —-—1 \8) 
Ago Ng 
where 
arf ' 2 ‘ 
= arc tan — (AQ) 
Ago 1 


Equation A9 gives the requirements for resonance. 
The expression for the loaded Q is 


2rf 


d 
| a (A10) 
2rl = 2xl 


Neue, Noa 
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The cutoff wavelengths are obtained when B, is equal to + 2 and we ha 


from equation A8 


Noa 


2rft 


r 


ge? 


from which we obtain 


arc 


Qos 
2 arc sin 


daft 


2rf . . 
x — arc sin B,/B? + 4’ iis 
2r€ . : 
. + arc sin Biv/B? + 4” (Al2 
2 
tan B i. Bi + 4Bi oe — 2 (A13) 
2 . se 


V BY + 4B 


This gives the loaded Q of the cavity in terms of the susceptance of the end 


obstacles. 


To derive the length corresponding to the excess phase of the cavity, let 
the short-circuited admittances be equal by equating equations Al and 
A2, and let the wavelength be the resonant wavelength of the cavity, and 


we have 
B, — cot 6 = — cot 6,. (A14) 
From equation A9 
B, = 2 cot 261 (A15) 
so that 
2 cot 2610 — cot 00 = —cot 64. (A106) 
But 
2 cot 26:0 — cot O09 = —tan yp (A17) 
hence 
tan O10 = cot 04 (A18) 
or 
610 + O00 = - (A19) 
That is 
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whence 


(+ 2¢’ = A420) 
which proves the second condition mentioned above, namely, that the sum 
if the lengths of the transmission lines in the cavity and its equivalent cir 
uit is equal to a half wavelength. 

The normalized admittance of the cavity terminated in the surge admit 


tance of the guide can be written in terms of its loaded Q and a wavelength 


us q 
VR+1+4 20| 2 (*) wi 1}. 421) 


This expression is obtained from equations A8 and Al3 by making the as 


variable as 


sumption that the bandwidth is narrow so that the sine of the angle in equa 
tion A8 can be replaced by the angle. This admittance is referred to a point 
slightly inside the cavity, i.e. a distance ¢’ inside. 

The similarity between this expression and the corresponding one for the 
parallel resonant circuit consisting of lumped elements is evident. (See eq 


7 of the text.) 
. f fo 
YR = 1+ 20 : \22) 
In the case of the cavity the bracketed term is a wavelength variable; in 
the case of the tuned circuit it is a frequency variable. 
The loss function for maximally-flat filters in waveguides becomes 


Po Ayo \ f" 
=_i1+1]0,2 ~ — | . 23) 
P, [ (A } A y 


g 


The loaded Q’s of the cavities taper sinusoidally from one end of the filter to 


. far— i 
Or sin ( : ) T. \24) 
2n 


the other so that 











Transient Response of an FM Receiver 
By MANVEL K. ZINN 
INTRODUCTION 


HIS paper develops various formulas for the response of an FM 

receiver to signal or noise input voltages of arbitrary form. The prin 
cipal object in view is to obtain a more complete understanding of how a1 
I'M receiver responds to, transient voltages, such as those arising from 
ignition interference, but the more general aspects of the theory have other 
applications as well. In particular, general formulas are given for the re 
sponse of a linear circuit to an applied voltage, or current, of variable fre 
quency. The Fourier transforms, or frequency spectra, of the response, and 
the envelope thereof, are determined. 

Two examples are given: (1) the audio response of an FM receiver to a 
very large impulse and (2) the response, including harmonic distortion, to 
a sinusoidal siznal wave. 

The element of an FM receiver that demands most discussion is the bal- 
anced frequency detector. The greater part of the paper accordingly deals 
with that important element. The general problem can be stated as follows: 
A limiter and frequency detector are transmitting a steady unmodulated 
carrier wave to an audio output circuit. At time, ¢ = 0, frequency modula- 
tion of arbitrary form is applied to the carrier (either by signal modulation or 
a superposed noise transient). What is the audio output voltage that re- 
sults? 

FREQUENCY DETECTOR 

Except for the greater bandwidth, the amplifiers and selective circuits 
between the antenna and the limiter of an FM receiver are similar to those 
of an AM receiver in their transmission features. If the selective circuits 
have a bandwidth ample to accommodate the maximum frequency swing of 
the FM transmitter, and if the transmission over the band is substantially 
“flat”? and the phase shift nearly linear with frequency, the amplifiers will 
introduce little distortion. The limiter and frequency detector are therefore 
regarded as the distinctive elements of an FM receiver meriting theoretical 
discussion. 

The literature contains descriptions of frequency detectors of several 
types together with adequate analyses of the action of the circuits based on 
the variable impedance concept.' The more generally used circuits can be 


1See Items 3 to 6 in list of references attached. 
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reduced to the circuit shown schematically in Fig. 1, which can be taken to 
illustrate a generic form of frequency detector. 72; and Z, are two resonant 
impedances tuned to different frequencies, one above, the other below, the 
carrier frequency.” For example, the simplest version of Z; and Z2 could be, 
for each, a parallel combination of R, 1 and C. Across each of these 
impedances is connected a rectifier with load circuit so proportioned that 
the rectification is substantially linear. The rectifiers are poled so that their 
low-frequency outputs are opposed, thereby obtaining cancellation of even- 
order demodulation products. With this arrangement, the low-frequency 
output voltage V,, which is applied to the audio amplifier, is substantially 
proportional to the difference between the envelopes of the voltage drops 
across Z; and 2». 

The resistance elements of the impedances, 7; and Z2, each include a shunt 
ing resistance equal to half the load resistance of the associated rectifier, 
which therefore determines, to some extent, the ( of the tuned circuit. The 
output diode load, R, Co, has negligible impedance at the carrier frequency. 
Under these conditions, the low-frequency output voltage across the two- 


rectifier load impedances is 


where » = detection efficiency (nearly unity) 
Vi, Ve = high-frequency voltages across 7, Z:, respectively (Fig. 1 


[V] = envelope of V. 
All this is in accord with the accepted understanding of the operation of a 
properly designed linear rectifier working at an efficiency approaching 100 
per cent. 

The amplitudes of the voltages, V; and V2, across the resonant imped- 
ances, Z, and Z, , of Fig. 1 are shown in Fig. 2. In the practical engineering 
analysis of this frequency detector circuit, employing the idea of impedance 
that varies in step with the instantaneous frequency, the two voltages of 
Fig. 2 are subtracted (owing to the opposed polarities of the rectifiers) to 
obtain the over-all voltage-frequency characteristic shown in Fig. 3. Then 
it is inferred, by physical intuition, that if the instantaneous frequency of 
the carrier is varied at the input, the output voltage wave will vary as in- 
dicated by the curve of Fig. 3. Strictly speaking, this is a false assumption, 
but where the rate of variation of the instantaneous frequency is at an audio 
signal frequency far below the carrier frequency, the error in the assump- 
tion is of no importance, whereas ihe simplification in thinking accomplished 

2 The term carrier frequency will be used to designate the value of the unmodulated 
received frequency after all heterodyne conversions. (This frequency is equal to the mid 
band frequency of the last intermediate frequency amplifier ahead of the limiter, if tuning 
is perfect.) 
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by it is considerable. It is only where the rate of variation of the insta 
taneous frequency is high, as it can be in the case of a large noise transie 
caused by impulse excitation, that the error in the assumption in questi 
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Fig. 2—Voltages across tuned circuits of frequency detector 
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Fig. 3—Output voltage of frequency detector 


can become serious. A particularly subtle error that can arise from the 
assumption is to fall into the habit of regarding the characteristic curve of 
Fig. 3 as a frequency transmission curve of the sort obtained by measuring 
the ratio of output to input of a linear network over a range of frequencies. 











TRANSIENT RESPONSE OF AN FM RECEIVER 717 
































~ The curve of Fig. 3 is not such a transmission curve, because the principle of 
_ superposition does not apply and a frequency conversion is involved 
oe Owing to the considerations discussed above, the analysis to follow avoids 
the assumption of variable impedance associated with the varying 
instantaneous frequency. This does not imply that the assumption, as 
employed by various writers, is considered seriously erroneous, but, rather, 
that it seems preferable to develop the theory without invoking the as 
sumption, provided that this can be done without falling into unmanageable 
complications. Briefly, the procedure in the work to follow is to determine 
directly the envelopes of the voltages V; and V. as functions of time, one 
envelope then being subtracted from the other to obtain the output wave 
2 Zn , 
to o 
z —_ oll . is 
© 7h is 
a7 4 | 
< os ~ 
Fig. 4—Current wave out of limiter 
GENERAL THEORY 
When a carrier is being received, the limiter can be regarded as substanti- 
ally a constant current source having an internal shunt admittance small 
compared to the admittance of the tuned impedance elements, Z; and 22, 
of the frequency detector. If the limiting is severe, as it should be for good 
operation, the current delivered by the limiter is a rectangular wave as 
illustrated in Fig. 4. When this current is driven through the impedances, 
Z, and Z,, the voltage drops, V; and V2, that arise across these elements 
are substantially sinusoidal in form, owing to the selectivity, which practi- 
cally extinguishes all harmonics of the carrier frequency. We therefore take 
the current input to be sinusoidal in the first place, namely 
I(t) = hcos 2rfol 
This is the unmodulated current, 7h/4 being the current cutoff point of the 
limiter and f, the frequency of the carrier. When the carrier is modulated 
in frequency, we write 
I(t) = hos [2rfol + @(t)] (1) 
where @(/) is the phase angle varying with time. ‘The instantaneous fre- 
quency then is 
i f(t) = Fs ! [2rfot + O(t)) = fo + . 6’(t). (2) 
; 2m dt ar 
: 
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In the transmission of signals by frequency modulation, the instantane 
radian frequency deviation, 6’(/), is made to vary in proportion to t 
signal amplitude, so that @(/) then varies in proportion to the time integ: 
of the signal amplitude. 

As a preliminary step to the discussion of the frequency detector itse 
we require a formula for the voltage drop across an impedance Z(f) when ti 
frequency-modulated current (1) flows through it. The point of vic 
usually adopted is to regard the impedance as a composite function of tim¢ 
viz., Z| f(/)|, and to say that the voltage across it is 


VU) = MOZLAD)) = MHZ li. + . wi]. (3 
¥ 2 


S 


This quasi-stationary viewpoint gives results that are nearly correct if the 
| 3 o 


rate of change, 0”(¢)/ 2m, of the variable frequency is not too large. The 
magnitude of the error has been determined in a paper by Carson and Fry. 
In the present paper, impedance is a function of frequency that is inde 
pendent of time, as in the classic theory of linear systems. The frequent 
use of the term “instantaneous frequency,” as detined by (2), does not 
imply a departure from this point of view. 

In the following, /(/) is, in general, the voltage response as a function 
of time, of a network to a unit impulse of current applied at time / = 0. 
In the case of a two-terminal impedance element, //(¢) is the voltage drop 
across the element when a unit impulse of current is sent through it. Then, 
if the frequency modulated current (1) flow through the impedance, the 
voltage drop is 

a 


Vib) =e | cos [wolf — 7) + O(t — r)|H(r) dr (4) 
( 


) 


where w) = 27fy. 6(t) can have any form as a function of time. V(t) 
can be written, 


: / : t twyr? 1h \ T) 
V(t) = Seal | ae H(r) dr 


} t " iwnar— 10 rT) f 
1 1 eo iwo | eit '” H(r) dr. ( 
2 0 


wn 


In the frequency detector problem, the result finally desired is the envelope 
of the voltage wave. It will clarify the discussion to explain first what is 


meant by an envelope. If the voltage is of the form 


V(t) = c(t) cos [wot + o(2)] 


= a(t) COS wol — B(Z) sin wol 
= }[a(t) + ib(d)] e*" + 3fa(t) — ib(d)] e (6) 


> Item 1 in the bibliography. See formula 21 in that paper. 
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the complex function, 


(V(t)] = c(t) ee? = a(t) + id(t) j 


is here called the “envelope function”’ of the voltage with respect to the 
radian frequency wo , c(/) being a real amplitude modulation factor, which is 
the envelope* itself, as usually conceived, and exp {i@()] a complex fre 
quency modulation factor, in which ¢’(/) is the instantaneous deviation of 
the radian frequency from the reference value, w. If such a modulated 
voltage wave is applied to an ideal linear detector, the output voltage across 
the load circuit of the latter is the real envelope, c(/) = {a*(¢) + 6 
This concept of an envelope function provides a convenient ceneralization 
of modulation ideas. Both amplitude modulation and frequency modula- 
tion vary the envelope function, but in different ways. In amplitude mod 
ulation, the real maznitude, c(/), is varied while the angle @ is constant, 
whereas, in frequency modulation, ¢ is constant and it is the angle, ¢(/), that 
is varied. 

It will be seen that (5) is in precisely the same form as (6), so that we can 
write the envelope function of V(¢) immediately, as follows: 

= 


[V(i)] = a(t) + ib(t) = h | eo wortMt—) FIL) dr. § 
“0 
The conjugate envelope function then is 


[Vip] = a(t) — ib) =] ee AC) dr. (9 
“0 
The spectrum of the envelope function is also of interest. To obtain the 
spectrum, which we shall call, /o(/), we find the Fourier transform (hereafter 
abbreviated, F.T.) of both sides of (8), viz.: 


a a © 


Fi(f) = | [Vit)le “dt = h ” as e wortiMt—r) m7) de dt. (10 


It is permissible to reverse the order of integration of 7 and /, obtaining 


Fo(f) =h | e “°" H(t) | eo ae. 11 
\ /0 0 
The F.T. of # exp [7@(¢)] will be designated, ¥(/), i.e. 


x 


W(f) = i | ae 12 


wv © 


‘ The ‘‘envelope’’, so defined, is an engineering concept and is not quite the same thing 
as the envelope of mathematics, which is always tangent to a curve or set of curves 
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Putting ¢ — + in place of / in place of ¢ as the variable of integration in (1 
we have 


a 2 
VI f) = he" | e twt+140(t—r) dt. (1 
— 2 


hus it is seen that the inner integral of (11) is equal toe “’W(/) and t! 
equation becomes 


»® 


Ff) = ¥(f) | W(r)e****™ dr. (14 
“0 
Now the F.T. of H(t) is Z(f), i.e. 
zin= | HWwe* dt. (15 
4 P 
Therefore 
Z(f + fo) = i H(t)e ***** dt. (16) 


his differs from the integral in (14) only in the lower limit of integration. 
But since //(¢) is the response to an impulse applied at time ¢ = 0, H(t) = 
0 for / < Oand the two integrals are therefore equal. Putting (16) in (14) 
we have, finally 


Fo(f) | [a(t) + ib(t)Je ““' dt = V(PZ(f + fo). (17) 


The F.T. of the conjugate envelope function, a — 18, is 
a 


Fy(—f) ! [a(t) — ibe “‘ dt = ¥(—f)Z(—f + fo) (18) 


where symbols with the superbar denote the complex conjugates of unbarred 
symbols. Since Z(/) is the F.T. of a real variable, //(¢), it must assume con- 
jugate values for positive and negative values of f, ie., Z(f) = Z(—/f) and 
therefore Z(f + fo) = Z(— f + fo). Consequently, (18) could be written 
Fo(—f) = ¥(—fZ(f + fo) (19) 
(17) and (18) are the final solutions in frequency functions corresponding 
to the solutions (8) and (9) in time functions. The formulas in frequency 
functions have the advantage of compactness, which makes them easy to 
remember. 
We require also the F.T. of the voltage itself, i (4), which we shall call 
F(f). From (6) 


- a 


F(f) = | Vie “* dt =} | (a + ibje “°°? "dt 
” 2 


+ : [ (a — ibe ***#0 gt, (20) 
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and from (17) and (18) this evidently is 


F(f) = 4W(f — fo)Z(f) + 3%(-f — fo) Z(—f) 21) 
or, since Z(f) = Z(—f) 
F(f) = 4Z(f) [WU — fo) + ¥(—-f — fod). 22) 


Anyone familiar with the rules of Fourier transforms could write down this 
frequency function in the first place and then proceed to find the time fun 
tions by the reverse of the process just carried out. But the time functions 
are more closely related to the physics of the problem and therefore provide 
a more fundamental starting point for its solution. 

It will be appreciated that, although the above discussion has been phrased 
to apply to the problem of finding the voltage drop across an impedance 
when a frequency modulated current flows through it, the formulas also give 
the current through an admittance when a frequency-modulated voltage is 
applied across it. They also give the output voltage or current of a four- 
terminal network when a frequency-modulated current or voltage is applied 
at the input. These various applications of the formulas obviously can be 
made by placing definitions on Z and H appropriate to the particular 
problem. 

The next step is to assume suitable values of the impedance Z(/) and vart- 
ous forms of the frequency modulation function @ and to employ these 
particular values in the general formulas 8, 9, 17 and 18. 


BALANCED FREQUENCY DETECTOR 

The impedance can be defined, in general, as a rational algebraic function, 
viz.: 

(iw — a)(iw — de) --+ (tw — dm) P(iw) 


(iw — pr)(tw — po) +++ (iw — pp) zi O(iw) © 


Z(iw) (23) 
Writing the polynominals P and Q in this way as the products of their factors 
exhibits the a’s as the zeros of Z and the p’s as the poles. The latter deter- 
mine the frequencies of free vibration of the network. For the network to 
be stable, the p’s must all have negative real parts. The a’s and p’s are 
either real or occur in conjugate complex pairs. By the partial fraction 
rule, the expression can be broken up into a series of simple fractions; thus 
1, Ay 


Z(w) = ' + ~ 4+ +--+, (24) 
lw — py lw — po 


If the poles are all simple, the A’s are given by 
_ P(b;) 
Q(B) 


Nm 
wn 
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For the present purpose, only a pair of terms of (24) need be conside 
This will provide a specific solution of the balanced frequency detector for 
case, previously used for illustration, where the impedances are two sin 
resonant circuits of parallel R, 1 and C. At the same time, this solut 
can be extended to more complicated circuits by superposing a number 
such elementary solutions, as is clearly possible with the type of impedai 
development indicated by (24). 


The impedance of R, / and C in parallel, written in the form of (23) a: 


(24), is 
. iw 1 k k 
Z\tw) = - : + 
C (tw fi)(tw ps) lw fi lw — pr» 
(6 
k k 
: wry wt ¥ 
where 
pb=y=at ip 
pr = ¥y = a us 
F o(1 2 =) 
2 83 
P : (1 4 =) 
2 3 
se 1 3 _ 1 1 = 2 2 = I 
OS ie ae ee, ee ee 


The voltage response of the circuit to a unit impulse of current is then 


stags 4 k iwt yt Ft ~ 
H(t) = (, : + r _) ¢ Gf = he” ke. 68S O «6 007) 
Jo \iw + ¥ iw + ¥ 


To find the envelope function of the voltage drop when the frequency 
modulated current 


T(t) 4 (ermot rate eM e twot 10(t)) (1) 


is applied to the circuit, we make use of the general formula, (17), which 


states that the spectrum, or Fourier transform, of this envelope function is 


Fi(f) = [ [a(t) + ib(t)le “dt = W(f)Z(f + fo) (17) 


a 
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where W(f) is the F.T. of 4 exp [7 6(/)] and Z(/) is the impedance, (26), in this 
case. The envelope function then is 


a(t) + ib(t [ WiAZ(t + ie dt 


ot 
hig er | ? ali biol aals = 28 
+ hke “et?! eres ee 


Phis result is obtained by employing the convolution formula’®, which states 
that if F; and Fy are F.T.’s of Gi and Gy» , respectively, then the F.T. of F\F2 is 
0 a 
Fi(f)F2(fye df = ((7)S2(t — 7) dr. (29 
; | 
(The upper limit of the intezrals in (28) is /instead of « for the reason that 
H(t) is zero for ¢ < 0.) The result could have been obtained equally well 
without using the Fourier transforms by substituting (27) in (8). When 
A(t) is specified mathematically in the infinite interval (—«, «) these 
formulas give the resultant of the steady state and transient oscillations 
Various problems can be solved by specifying particular forms of variation 
for A(t). Two examples follow: (1) where the instantaneous frequency, 
6’(/), is an impulse and (2), where 6’(/) is a sinusoidal wave, as for elemen- 
tary signal transmission. 


Example 1: Impulse Modulation 


Ignition interference comprises a sequence of sharp impulses, each of du- 
ration very brief compared to the interval between them, so that the tran- 
sient in the receiver produced by one impulse dies away before the next one 
arrives. It is therefore sufficient to consider the disturbance caused by a 
single impulse. 

If the receiver is perfectly tuned, an impulse produces, in the tuned cir- 
cuits, a transient of the same nominal frequency® as the signal carrier. 
When superposed on the carrier, the interfering transient alters, or mod- 
ulates, both the amplitude and phase of the carrier. The amplitude mod- 
ulation is wiped out by the limiter, but the phase modulation remains to 
produce noise in the output. The phase shift caused by the transient is a 
random variable, because it depends upon the time of arrival of the impulse, 
and this is entirely fortuitous. 

5 See pair 202 of Item 10 in the bibliography. 

§ By ‘‘nominal frequency”’ is meant the frequency as determined by counting zeros of 


the wave. The transient actually comprises a spectrum of frequencies spread over the 
band of the tuned circuits, of course 
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It is of engineering interest to determine the noise produced by a ve 
large impulse, exceeding greatly the amplitude of the siznal carrier. Wh 
such a large impulse arrives, it causes a sudden jump, or discontinuity, 
the phase of the carrier. The excursion of the instantaneous frequen 
corresponding to the phase jump is indefinitely large and the problem a 
cordingly cannot be solved satis‘actorily by means of the usual assumpti: 
of quasi-stationary frequency. The problem of large impulsive interferen 
provides, therefore, the principal justification for the more exact method | 
analysis here employed. In the paragraph following, the problem is r¢ 
stated in terms providing a suitable basis for mathematical analysis. 

We assume, as before, that the limiter is delivering to the frequency « 
tector a steady carrier current of constant amplitude /# and frequency /, 
At time ¢ = 0 a brief disturbance occurs specified by the statement that 
the instantaneous frequency, 6’(t), of the current suddenly executes a1 
impulse of moment 6. That is: 6’(/) is zero at all times except at ¢ = 0, 
when it goes to infinity and back to zero again in such a way that the area 
of the impulse so formed is 8. The carrier current amplitude then remains 
constant but the phase, 6(/), of the carrier takes a sudden jump of © radians 
at / = 0. What is the voltage output of the frequency detector? 

The general formula (28) gives directly the envelope function of the volt- 
age across the impedance (26) for a phase function @(/). In this formula we 
have now to put 6(/) = 0 before time /= Oand 0, after¢ = 0. We do this by 
dividing the interval of integration into two parts, (— «, 0) and (0, ¢); thus 


al 


t 
a(t) + ib(t) = hke “OT”! (| fer" ee” | ” iit ir) 
- 2 0 


el 


/ 2 
wotF)t wotd)r 0 (iwo +5) 
+ kke “*"" (| e* dr + e' eo ir) 
-2 “0 / 


) 10 i0 


le twoty)t ' ; 1 —" ~~ lwoty)t Fg 
: = + hk ll + :. (3p) 


10% + 7 1a + 7 


it 
(l—e 
=-hk 
Let the radian frequency interval by which the applied frequency wy 
is set off from the resonant frequency w,. be 
A = a — WwW (31) 
as indicated on the curves of Fig. 2. Whena/w, is small compared to unity, 
as it is in practical circuits, 8 is very nearly equal to w.. (See the formulas 
following equations (26).) Then 
iwo + Y = two +a + iw, = a — iA + iw 


and 


two + Y = tw ta — iw = at id. (32) 
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sum ] 
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From this it is evident that the first term of (30) contains the demodulation 


sum product of frequency on the order of 2w). This frequency will be 
suppressed by the diode load circuit and consequently the second term of 
30) is an adequate representation of the envelope function. Therefore we 
write 
a(t) + 1b(¢ hk sacl it 33 
lw rT ¥ 


a(t) + 7b(t ws , 34 


One deduction that can be made immediately from this formula is that the 
frequency of the oscillation in the output of the rectifier caused by the phase 
jump at the input is A, the radian frequency interval by which the applied 
carrier frequency differs from the resonant frequency. The oscillation is 
heavily damped, however, because a, while being very small compared to 
w,, is comparable in magnitude with A in circuits commonly used. 

The angle of the complex envelope function (34) represents merely) 
a phase shift of the carrier frequency w,. We are interested only in the 
magnitude of the function, viz.: 


wn 
an 


c(t) = [ar(t) + BD] 


After some algebraic work, the desired formula comes out of (34) in the 
following form: 


i 5) Bude 
h F 2n:(t) sin | At + + m (t) 
cll) = : 2 a a | 


where 
m(1) Je * sin (36) 


The discussion so far has dealt with a single impedance (or network) and has 
been concerned with obtaining formulas for the voltage across the imped 
ance, and the envelope thereof, when a frequency-modulated current is sent 
through it. It is neeessary now to refer to the construction of the balanced 
frequency detector, which is the particular object of our study. Figure 1 
shows two impedances having the variation with frequency sketched in 
Fig. 2. The carrier current is driven through the two impedances in series 
and linear rectifiers re connected across each in such polarity that their 
low-frequency output voltages are opposed. We assume that the output 
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voltage of each rectifier is the envelope of the voltage existing across i 
associated impedance. Therefore, to find the total output of the balance 
frequency detector, we have to find the difference between the envelopes 
these voltages. 

It is necessary to specify the two impedances more precisely. It appear 
that the best operation is obtained if the frequency of the carrier is midway) 
between the resonant frequencies of the two impedances. That is 


A — @o- @1= @~— Wo 


where w, , w, are the resonant frequencies of Z; , Z2 , (previously written as 
w., for any impedance). Furthermore it appears that the two impedances 


should have identical values of C and very nearly the same damping con 


stants. The design of the detector circuit is accordingly specified by 
C= &=C 
R,= Rk: =R 
Ly = 1/aiC 
ba =] wot 


and then 


(a Wo K 1) 


Ly We -~ Wy + A 


All the quantities are assumed to be substantially constant over the signiti- 
cant frequency range. 


With the circuit constants so proportioned, it can be seen from (36) that 
the envelope of the voltage across Z, differs from that across Z2 only in the 
signof A. Therefore, the output voltage of the balanced frequency detector, 


when the instantaneous frequency variation is an impulse of moment 0 at 
t= 0, is 


Vo(t) . C(t) ss (1) 


.% — 0 — 
C (a +A) st ({ + 2»: sin (a — ) + nT 
6 1/2 (38) 
al E — 2m sin (ai 4 * + mi | ), i>@ 


to 


0 
_ 


- 


» t ® 
= (), ! <0, where m = 2e “™ sin 
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On Fig. 5 is given a plot of this function for a value a, A = 1 of the rela- 
tivedamping. Calculations for other values of a A show that the output os 





cillates only weakly for a/A = } and is nearly dead-beat fora A = 2 

1.0 

ae Al or 
V(t) = === F(t OLTS 
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Fig. 5—Transient response of a balanced frequency detector for a step 0 in the carrier 
phase 
Example 2: Signal Reception 
If the carrier is frequency-modulated by a signal 


s(t) = S cos gt (39) 


and the frequency modulation factor of the transmitter is u, the phase of the 
carrier wave is made to vary in accordance with the relation 

es ws. ‘ 

A(t) = w | s(t) dt = sin gt = x sin gl 1()) 

pn gq 
wS is then the radian frequency deviation of the transmitter and wS/4q, 
which is the ratio of this frequency deviation to the frequency of the signal, 
is commonly referred to as the “frequency deviation ratio.” This factor, 
which is denoted by x in the above equation, enters as a fundamental 
parameter in all FM theory. 

To find the envelope function of the voltage wave produced across the 
impedance (26), when the frequency-modulated carrier is received at the 
frequency detector, we put (40) in (28). To effect the integration, the 
expansion 


2 


grein qt = bs T,(xyen™ rr ) 


N=—20 
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is used, /,,(x) being the Bessel coefficient of the first kind, of the nth order a 
with the argument x. For brevity, /,(x) will be written J,. Also, t 
first term of (28) is to be omitted, because, as was shown in the precedii 
example, it represents frequency sum terms which are filtered out by t 


diode output circuit. Then we have 
a(t) + ib(t) = hke ‘““*°*7 ‘ eens > DP wes |S (42 
‘a , 


When the integration is carried out, the result is 
~ ta rt iit 
a(t) + 1b(0) hk ym —— = hk =. —. (43 
n=—wo lWo T Y T Ing nm—o @ - 1A - Ing 
lo obtain the magnitude of a + 7b, which is the envelope required, we multi 
ply the above Fourier series for a + 76 by that for a — 1b, obtaining a double 


summation, which can be written as follows: 


2 
C(t) a(t) n b(t) bye > (ene + Ge inaty 
n=1 


(44) 
a) 
a + 2 > dn cos ngl 
eal 
where a, is the real part of the complex coefficient c, and dp = ¢o. 
The coefficients are given by 
he <. Cone paren 
AC? maa (a — iA — img)[a + id + i(m + n)q] 
(45) 
ht = Su x +n 
Cy = 4 "cae —RaRC ; Ts 
AC? moe (a + iA + img)[a — iA — i(m + n)q]) 
Obtaining this result involves use of the relation 
J_.(x) = (—)" J, (x). (46) 
Krom (45) 
WS Im Imani’ + (A + mg)(A + mg + nq)| (47) 
ad, +/ 


4C? moe la? + (A + mq)*I[a2 + (A + mg + ng)*] 

Finally, we have to obtain, as before, the difference between the en- 
velopes of the voltages across the two impedances of the balanced frequency 
detector; that is, we have to determine 


V(t) = a(t) — a(t) (48) 


where c:(/) is given by (44) as it stands and @(¢) is obtained from the same 


expression merely by reversing the sign of A. The complete solution for 
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the output voltage of the frequency detector, when a phase variation, 
6’(t) = xg cos gt, is impressed on the carrier at the input, then is 


© 1/2 2 1/2 
Vo(t) = E + 2 > Gn2 COS nat | — E ss ae x dy. COS nat | 49) 
n=l 


bo 


n=l 


where d,; is given by (47) and a, is given by the same formula with the 
sign of A reversed. 

An approximation that is permissible when the frequency swing xy does 
not approach the available frequency range A is 


2 


V(t) = bi A,, cos ngt (50 


n=l 


where 
A, = = , n odd: 0, m even. (351) 


The table following gives the results of a computation of the first four 
coefficients from formulas (45) and (51) for the case of a frequency deviation 
ratio, x = 5, for g/2x = 2000 cycles per second, for A/2r = 30,000 cycles 
per second and fora/A = }, land 2. 


Coefficient af/A = 5 1.0 2.0 
Ao 0 0 0 
A, -848 478 191 
Ao 0 0 0 
As .0312 00438 ~ 00281 
Rhe 


Note: To obtain volts, multiply all values by Ewe 
a? + 42)! 

The coefficients for even values of n vanish, which confirms what can be 
inferred from physical considerations, namely, that the balanced construc- 
tion of the frequency detector eliminates the d.c. component and all even 
harmonics. From the ratio of A3 to A; we obtain the following ratios, ex- 
pressed in db’s, of the third harmonic distortion to the fundamental signal 
for the three circuit designs: 


a/A 20 log 10} A3/A, 
5 —28.7 db 
1.0 —40.8 
2.0 — 36.6 


The results for the sinusoidal signal, when considered in conjunction with 
those for the impulse modulation, also permit certain, conclusions regarding 
signal-to-noise ratios for impulsive interference in FM reception. 
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Ratio of Noise to Signal 


It may be helpful, in conclusion, to attempt a theoretical estimate of the 
ratio of noise to signal in the audio output under the condition of severe 
impulsive interference. 

rhe ratio of the peak value of the pulse to the signal amplitude at the 
frequency detector output is given by the ratio of the peak values of /(?), 
as plotted in figure 5, to the values of A; in the table above. To obtain 
a result of practical significance, however, the effect of the audio circuit 
should be taken into account. In the absence of specific information on 
the structure of this circuit, we assume that the peak value of a pulse at 
its output is equal to the area, or moment, of the pulse at its input times 
twice the audio cutoff frequency. This is true for an ideal ‘square cutoff” 
filter and not seriously in error for actual circuits. The area of the largest 
pulse at the frequency detector output is approximately 2A/(a° + A*) and 
the value of Aj, the signal fundamental amplitude, can be approximated by 


2A xq (en 
Ay a (52) 
a? + A? 
(For the example above, this approximation gives A; = .8, .5 and .2 as 


compared to the exact values, .848, .478 and .191.) In this way we arrive 
at the following estimate of the peak ratio of noise to signal in the audio 
output: 

Max. value of largest pulse _ Wa (53) 

Signal amplitude xg e 

where w, is the cutoff-frequency of the audio circuit, g the signal frequency 
and x the frequency deviation ratio. Then xq is the “frequency swing”’ of 
the transmitter, i. e., the maximum departure of the instantaneous frequen- 
cy from its mean value. It is to be noted that this formula is free from the 
detector circuit parameters, a, A, R, and indicates that, to a first approxi- 
mation, at least, the maximum ratio of noise to signal depends only upon 
the audio circuit cutoff frequency and the FM swing. Furthermore, this 
establishes a ceiling for the interference that will not be exceeded no mat- 
ter how large the impulses may be. 
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Transverse Fields in Traveling-Wave Tubes 


By J. R. PIERCE 


Traveling-wave tubes will have gain even if the r-f field at the mean position 
of the electron stream is purely transverse. The addition of a longitudinal mag 
netic focusing field reduces the gain due to transverse fields and increases the 
electron velocity for optimum gain 


LL slow electromagnetic waves have both longitudinal and transverse 


electric field components. Sometimes either the longitudinal or the 
transverse field may go to zero along a line or plane parallel to the direction 
of propagation. For instance, for the slow mode of propagation there is no 
transverse field on the axis of a helically-conducting sheet. Still, over any 
plane normal to the direction of propagation there are bound to be both 
longitudinal and transverse field components. 

If a very strong longitudinal magnetic field is used in connection with a 
traveling-wave tube, the transverse motions of electrons may be so restricted 
as to be of little importance. With weak focusing fields, however, the trans 
verse motion of electrons may be important in producing gain. The trans 
verse fields can force the electrons sidewise, and thus change the 
longitudinal fields acting on them in such a way as to abstract energy from 
the electron stream.' This is closely analagous to the action of the longi- 
tudinal fields in displacing electrons forward or backward into regions of 
greater or lesser longitudinal field. 

The purpose of this paper is to analyze the behavior of traveling-wave 
tubes in which transverse fields are important. The attack will be similar 
to that used previously.” 


1. Crrcurr THEORY 
In this paper we shall consider only the electric field associated with the 
slow mode of propagation along the circuit having a speed close to the elec- 
tron speed, and we shall neglect other field components attributable to local 
space charge. The writer believes the results so obtained to be valid at low 
currents but in error at high currents, and an acceptable guide at currents 
usually encountered. 
In an earlier paper’ a relation was found between the longitudinal field £: 
excited in a mode of propagation of a transmission system and the longitud- 
See, for instance, J. R. Pierce and W. G. Shepherd, “Reflex Oscillators,” B.S. T. J., 
Vol. 26, No. 3, pp. 666-670 (July, 1947). 
* J. R. Pierce, “Theory of the Beam-Type Traveling-Wave Tube,” Proc. I. R. ., Vol. 


35, pp. 111-123, Feb. 1947. 
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TRAVELING-WAVE TUBES 


inal exciting current g. Both #,and gq vary as (exp jw!) (exp — Tz). The 
relation is 
Io 


oe SOF — Fe 


Here Ip is the propagation constant of the transmission mode considered and 
is defined in such a sense that for unattenuated propagation, 9 = 735 where 
8) is a positive number. The quantity Yo is defined as 


2P 


Ww" pe 


Here P is complex power transmitted by the mode and £, is the field asso 
ciated with the mode. 
In generalizing (1), let us consider the combination of equations (1) and 
(2) 
* I"o 


*T’ —T9 


P* = \gE (3) 
Now, suppose there is motion of the electrons not only in the z direction but 
in a direction normal to the z direction, which we will call the y direction. 
We shall have two extra first-order terms of the same general nature as 
gE?, which contribute to the power, giving 
a4 * ’ 
p* = : (« EP + (—h)y = + gy, #3) % ee a, (4) 
Z Oy s = Fe 

Here q; is the a-c convection current in the z direction, —Jo is the d-c con- 
vection current in the z direction (assumed to be the only d-c convection 
current), y is a small displacement, g, is the convection current in the y 
direction and £, is the field in the y direction. 

We will now specialize this expression. Suppose we consider a two-dimen- 
sional transverse magnetic wave propagating in the z direction with a phase 
velocity v such that v < c. Then over a restricted region the electric 
field can be represented quite accurately as the gradient of a scaler potential 
of 


V = exp (—I's)(A exp (jl'y) + B exp (—jT'y)). (5) 


Here A and B are constants. Using our notation, in which the field is 
understood to include the factor exp (—Tz), we obtain 


E, = VT (A exp (jl y) + B exp (—jI y)) (6) 

7D ay? , » 

= * = jI°(A exp (gly) — B exp (—iTy)) (7) 
y 


E, = —jY(A exp (jy) — B exp (—jI y)). (8) 
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In other words 


OE . 
- ie: ae Ty! oe (9 
OV 
his relation will also be approximately correct remote from the axis in an 
axially symmetrical tube. Here we let y represent a displacement in the 7 
direction. 
We may also define a quantity @ so that 
E, = jak, (10 
-(A exp ((T'y) — B exp (—jIy)) (1 
Ayexp (jl',) + Bexp(—jI'y) ~ 
lor an active mode, such as the one we consider, the chief component of 
jis a positive real number. Hence, for large positive values of y, the 
quantity @ approaches a value 
a=1. (12 
This is characteristic of a plane symmetrical field far from the axis and also 
of an axially symmetrical field far from the axis. 
Using (9) and (12) we rewrite (4) 
* ern : 
p* = LE, (g- an Jar I *Toy ‘a qy)| (13) 
We see from this that, according to our assumptions, for the mode considered, 
I'o 
Gy}) 7 #702 y2\ * 
be! per — Ti) 


We will henceforward assume that @ and y are so nearly real that we can 


E, = (gz — ja*(T* Inv + (14) 


regard them as real quantities, giving ¢ 
. ° Swes \ 0 =-\ 
E, = |g, — jal(T* lov + gy)] (15) 


y(I" — To)’ 
This is, then, the circu't equation which we will use. 
2. ELECTRONICS EQUATIONS 
We will assume an unperturbed motion of velocity # in the s direction, 
parallel toa uniform magnetic field of strength B. Products of a-c quantities 
will be neglected. 
In the x direction, perpendicular to the y and ¢ direction 


dx 
dt 


Assume that 4 = Oat y= 0. Then 


—nBy. (16) 


x= —nBy. (17) 
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LUBLS 


In the y direction we have 


d yV 


- = (Be — jak,) (18) 
al 
Now 
ae _ oy 4. oy ds (19) 
dl al as dl 
j 
ly 
ae u(78 — T)y (20) 
dt F 
im, (21) 
My 


We obtain from (20) and (18), and (17) 


' iii, 2 mak, 
(78 —T)y = —mBoy — = 22) 
Mo 
3 
few SE (23) 
Uo 
# We may note that 7B is the electron cyclotron frequency. Now, 
‘ ov Ov dz 
Vy = —— 4 - 

at dz dt (24) 

vy = m(78 — T)y. 

From (24) and (22) we obtain 
= nak, e 
V 2r7 - ‘ 2 2 25) 
: wol(78 —T) + Bol 
—jna( 78 — V)E, 
"—— 2) - (26) 
Uo\( 78 <i I ) + Dol 
We will have for g, 
' 
v “ 
qy = —Io : (27) 
iy 
jnal,(j8 — V)E, 

‘ . ® ) 


Yy 2 . 1\2 2): 
wv = W{(ja — TY + 6) 


It is easily shown that 


nk, 
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If po is the d-c linear charge density and p the a-c linear 


charge density é 
I 
P= = a (30 
Ug 
If g, is the z component of convention current, we have 
ie = poz + Uo p 
Iy2 (SI 
= ——" + nop. 
Mo 
We also have 
Ods _ _ Op 
I'gs = jwp. 
From (31) and (32) we obtain 
— 7812 
5 gee (33 
(j8 — T) 
Thus i 
_ _ jnBIoE, - ! 
Qz — uo( j8 a tr)’ Jt) 


3. COMBINED EQUATION 


Combining (34), (28) and (25) with (15), we obtain 


nly Io We) a(T'* a (78 = t")) = 
uw wi —T)LG8-T) IGe—Tr) +s] [°° 


We now introduce new parameters 


(36) 


1 _ EE: 
By, 28?P 


ee | KI ham 
= ° Ee f] 
. (; 2) : 


Here Po is the power transmitted by the circuit for a field strength E,. K 
has the dimensions of impedance. Vo is the voltage specifying the electron 
velocity uo 


tuo = 2nVo. (38 


From (36)—(38) and (35) we see 





282C3 To | jb a(1'* = (78 _ 3 | 
(Gg—-T)Y” ([Ge—-—T) +i) | 


"<r" — T) 
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We now make the approximation that 


Pa —p+é (40) 
Where |5| <|8!. Neglecting higher order terms, 
ry | a? 
= 273 C E a | 41) 
I'y 6 0 + Po 


4. PURELY TRANSVERSE FIELD ALONG THE PATH 
We can imagine a case in which a approaches infinity and the quantity 


a... Hee te 


D’ = a ; (42) 
: BP 8V, 
remains finite. In this case we have 
r’—-ro  278°D* " 
_ = — 5 (43) 
To 0* + Do 
We will let 
—I'y5 = —j8 — 78Db — BDa. (44) 


Here d is a parameter describing the difference in speed between the electrons 
and the unperturbed wave and d is a loss parameter. 


Assuming 6D <1 and dD <1, and letting BD(« + jy) = 4, (45 
we find 
(x? — y*? + f*) (y + 5b) + 2xy(x + d) = —1 (40) 
(x? — y? + f*) (x2 + d) — J2xy(y + 56) = 0 (47) 
where 
Bo 
> = (48) 
I = 32 pp wi 


If would be difficult to work with all of the parameters b, fandd. How- 
ever, it scarcely seems that the attenuation parameter d should enter into 
any unusual phenomena due to the presence of the magnetic field. Accord 
ingly, let us investigate (46) and (47) ford = 0. We then obtain 

(sy + 6) + (f? — ¥*) (y+ 5b) = -1 (46a 
x(x? + (f? — y*) — 2y(y + 5)] = 0. (47a) 
From the « = 0 solution of (47a) we obtain 


x= 0 (49) 


(50) 
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If is found that this solution obtains for large and small values of 6. For 


very large and very small values of 6, either y = —6 (51) ory = +f (52 
The wave given by (50) is a circuit wave; that given by (51) represents the 
travel down the tube of electrons oscillating in the magnetic field with cyclo 
tron frequency. 
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Fig. 1Plot of parameters giving velocity and attenuation of the three forward waves 
vs. @ parameter 6 proportional to electron speed with respect to the undisturbed wave 
\ positive value of x means an increasing wave; a positive value of y means a wave travel 
ing faster than the electrons. This plot is for f? 0 (no magnetic field 


In an intermediate range of 6, we have from (47a) 


= tV2yW(y + 0) — (f? — 9°) (53 
and 
b= —ly+ Vf? — 1/2y. (54) 


For a given value of f° we can assume values of y and obtain values of 8. 
Phen, x can be obtained from (52) or (53). In Figs. 1-6, x and y are plotted 
vs. b for f? = 0, .5,1,2,4and10. It should be noted that 1; , the parameter 
expressing the rate of increase of the increasing wave, has a maximum at 
larger values of bas / is increased (as the magnetic focusing field is increased). 
Thus, for higher magnetic focusing fields the electrons must be shot into the 
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Fig. 2— Propagation parameters for f? = 0 
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Fig. 4—Propagation parameters for f? = 2.0 
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PROPORTIONAL TO MAGNETIC FIELD, f 


Parameter x giving rate of increase of increasing wave vs. f, which is propor 


tional to magnetic field strength. 


circuit faster to get optimum results than for low fields 
imum positive value of x is plotted vs. f. 


effect on gain of increasing the magnetic field. 


In Fig. 7, the max- 
The plot serves to illustrate the 
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Let us consider an example. Suppose 
\ = 7.5 cm 
D = 03 


These values are chosen because there is a longitudinal field tube which oper 
ates at 7.5 cm with a value of C (which corresponds to D) of about .03. Th 
table below shows the ratio of the maximum value of x; to the maximum 
value of x; for no magnetic field. 


Magnetic Field in Gauss 


r1/X 

0 0 | 
50 ei A i 
100 2.34 50 


A field of 50 to 100 gauss should be sufficient to give useful focusing action. 
Thus, it may be desirable to use magnetic focusing fields in deflection travel- 
ing wave tubes. This will be more especially true in low-voltage tubes, for 
which D may be expected to be higher than .03. 


5. Mrxep FIELDS 


In tubes designed for use with longitudinal fields, the transverse fields far 
off the axis approach in strength the longitudinal fields. The same is true 
of transverse field tubes far off the axis. Thus, it is of interest to consider 
equation (41) for cases in which q@ is neither very small nor very large, but 
rather is of the order of unity. 

If the magnetic field is very intense so that 8) is large, then the term con- 
taining a’, which represents the effect of transverse fields, will be very small 
and the tube will behave much as if the transverse fields were absent. 

Consideration of both terms presents considerable difficulty as (41) leads 
to 5 waves (5 values of 6) instead of 3. The writer has attacked the problem 
only for the special case of 6 = d = 0. In this case we obtain from (41) 


9 


ee? ie 2 
b= -#Cla + z— JI. 
eres e 


In work which is given as an appendix, Dr. L. A. MacColl has shown that 
the two “‘new” waves (waves introduced when a = 0) are unattenuated and 
thus unimportant and uninteresting (unless, as an off-chance, they have 
some drastic effect in fitting the boundary conditions). 

Proceeding from this information, we will find the change in 6 as 8% is in- 
creased from zero. From (51) we obtain 


, Ws 72 2 192 
ds = jC | - se ea | (56) 
. _ 68 (8% + 62)? (62 + 63)? 
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Now, if Bo = 0 


<3 
+93 73 0 ete’ 
—73°C° = : (5 
; (1 + a?) ‘ie 


Using this in connection with (56) for 8) = 0 we obtain 


) 2 
. “am 2 fe 

di = —— dpo. (58) 

350 

For the increasing wave 
6: = B(.866 — $5), (59) 
Hence, for this wave 
a ra . -, ABo 

dé; = 2(—.866 + 7.5) 7 (60) 


[his shows that applying a small magnetic field tends to decrease the gain. 
[his does not mean, however, that the gain with a longitudinal and a trans- 
verse field and a magnetic field is less than the gain with the longitudinal 
field alone. To see this, we can assume that not 8» but a’ is small. Differ- 
entiating (55) we obtain 


inte al — 2dé 2a°6d6 da? 
i al ab 6 (FP +B) (+ |: — 
Ifa = 0 
—jp Cc = 6 (62) 
1 6° da? 
dé = 52 (63) 


3 (6 + Bo)’ 
2. - . . ~~ 4 
If 8p is zero, a small transverse field (small increase in a) increases the magni- 

tude of 6 without changing the phase angle. If 8) > 6. , then 

, iB C . 2 
dé = aa da (4) 
B85 

and the change in 6 is purely imaginary. For the increasing wave, the change 
iné asa transverse field is added will range from an increase in the real part 
for smalljmagnetic fields to no change in the real part for large magnetic 


fields. 
APPENDIX 
Stupy OF THE ALGEBRAIC EQUATION 
, oes a 
6 = —76 ¢ — =f ——___. (1A 
i - ee Bo 


53(8 + Bo) + 78°C(& + Bi + aS?) = 0 


5° + Bod + j8°C%(1 + a*)6 + 788°C? = 0 
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(2) + (2) (2) ca econ 4) ai Ae a 
Bo i, aa ee T\B, 7 


Write 
6 —_ ° 
Bo 
 ) a 
(<) ( , = dad (3 \ 
B\* . 
(2) Cia’ = db 
0 
Then 


26+ 33+ j(a + 5)? + ja = 0 (4A 


a is assumed to be positive, and 6 is assumed to be real and non-negativ: 
For 6 = 0 we have 
(zs? + ja) (22 + 1) = 0 (5A 


. © = 1/8 + 1/3 293 1/3 4977/8 
595-9, 98 4 98: Ce 


Af ya. £ (6A 
We have 


4 


: As -* 
[Sct + 32 + 2j(a + b)s] = + jz =0 
00 


Oz —4jS £ 

ab 524 + 322 + 2j(a + b)s on 
From this we draw the following conclusion. Suppose that for a certain 
value of 6 the five roots are distinct, and that among them there is a purely 
imaginary root. Then as 6 varies, in the neighborhood of its initial value, 
that root remains purely imaginary. 

In particular, consider 6 as increasing from the initial value 0. As long 

as the five roots remain distinct, there are exactly three purely imaginary 
roots. 


In order to have a real root s = x, we would have to have simultaneously 
a+ x2 = 0 
(a+ bdx?+a=0 (8A) 


This is impossible (with @ > 0). Hence there is never a real root. 
In particular, as } increases from 0, no root can cross the real axis. Hence, 
as 6 increases from 0, as long as the roots remain distinct, there are two 


purely imaginary roots above the real axis, one purely imaginary root below 
the real axis, and two complex roots below the real axis. 
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Since there is no term in 3* in the equation, the sum of all the roots is 0 
Hence the two complex roots must be located symmetrically with respect to 
he imaginary axis. 

First order variations of the roots with 6 can be calculated at once by 
means of the equation 


Oz 75 


ab oz! + 32* + 2y(a + d)z 


OA 


In principle, higher-order variations can be calculated by carrying the 
differentiation to higher orders. However, the formulae get wonderfully 
complicated. 

A very practical way of solving the equation is the following: 

The three imaginary roots can be found by plotting a curve. If we let 
cs = sy, (4A) becomes 


y— y+ (a+ by —a=0 (10A) 


For the imaginary roots y is real and we have merely to plot the left-hand 
side of (10A) vs. y to find the roots. Denote them by 2: , 22 , 33 , which are 


now regarded as known numbers. These roots satisfy the equation 


(g — 2)(3 — 22)(3 — 33) 33 —(g, + so + 33)2? + (2122 + 2133 + Zet3)2 


A A 33 39 33)3 312 3133 Ses 
— 549 23 + ays? + avz + as 0 11A) 
The two complex roots satisfy some equation 
+ Os + Be = O (12A) 


The @’s are at present unknown. When we find them we can at once calcu- 


late the complex roots. We must have 
3 


> 


+ ays? + aos + az) (2? + Bis + Bo) = 3? + 3? + 7(a+ b)s? + 7a 13A) 


(3 


Comparing the coefficients of st and z , we get the equations 


33 = ja (14A) 


which give us the @’s. 

Suppose that the magnetic field is very small, so that 8) <g. Then 
unless @ is very small, both a and & in (10A) will be very large numbers, and 
we find that two of the imaginary roots are given approximately by 


1 
a 
y= (4, 
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As 8) —+ 0 the other three roots are given by 
l . 1/3 
5 = |-j(a + 3D)| (164 


rhese three roots correspond to the waves found for traveling-wave tub 
with a purely longitudinal field. The roots according to (15A) represe 
such a combination of deflection’ and bunching as to produce no induc« 
current in the circuit. The roots of (15A) are “extra” roots attributable 
the consideration of transverse fields and transverse electron motion. 


For the roots given by (15A), 6/8 — 0 as By) 0. Thus in this case it js 


convenient to form the solution of two parts, one varying as 


ifs ss ( Apo ) 
é sin 3 
a+b 
jB2 ( apo ) 
e COS 3 
ad + b 


As 3) — 0, the first of these approaches the form 


and the other varying as 


Bz 


and the second approaches the form 


Again, these ‘‘extra’’ waves produce no induced current in the circuit. 
Two additional pieces of information: 
As a — 0, 6 a remaining fixed, the roots approach the limiting values 


0, 0,0, 7, —7. 
Asa» %, ba remaining fixed, two of the roots approach the limiting values 


a 


LIN +8 


the other roots behave as 


9 


3445/3 
é 


qa -+- b)! ila +. b)! sais *, Jha +- b)' 


Much of the preceding discussion depends upon the roots remaining distinct. 
The condition that two or more of the roots coincide, which is a relation 
between a and 8, can be written out, but it has not as yet been reduced to a 
compact and intelligible form. 
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Abstracts of Technical Articles by Bell System Authors 


Pulse Code Modulation.' H. S. Biack and J. O. Epson. A radically 
new modulation technique for multichannel telephony has been developed 
which involves the conversion of speech waves into coded pulses. An 
8-channel system embodying these principles was produced. The method 
uppears to have exceptional possibilities from the standpoint of freedom 
from interference, but its full significance in connection with future radio 
and wire transmission may take some time to reveal. 

Modulation in Communication” F. A. Cowan. Any signaling system 
requires some means for introducing a change in conditions at the sending 
end which may be recognized at the receiving end. The process by whic! 
the conditions are changed has come to be called modulation. There are 
many varieties of forms of change as well as a large number of conditions 
which are subject to change in response to the signals to be transmitted. 

In early systems for communication at a distance the signal information 
might have been impressed upon a rising column of smoke, a light, a stone 
tablet, or a sheet of parchment. For modern communication wide use is 
made of systems in which the signals change the magnitude or condition 
of electric energy. 

Starting with the electric telegraph a little more than a hundred years ago 
this medium of communication has grown steadily more important and more 
complex. To meet a variety of needs many systems of modulation have 
been developed and papers describing them in detail are available in the 
technical literature. Various aspects of the modulation processes have 
been analyzed carefully and presented and some of the earlier conceptions 
have acquired a classical textbook status. Recent trends have placed 
emphasis on modulation systems which more readily may be understood 
when viewed in a somewhat different manner. It is the purpose of this 
paper to present certain conceptions which may facilitate the understanding 
of the various systems of modulation and permit an improved perspective. 

Frequency Division Techniques for a Coaxial Cable Network’ R. E. 
CraM.¥}. T:“Drxon and G. H. Huser. A description is given of develop 
ments employing frequency division techniques by which. the telephone- 
message-carrying potentialities of the coaxial cable system are realized. 
By these methods 480 high quality telephone messages are prepared for 


' Trans. A. 1. E. E,vol. 66, 1947 (pp. 895-899). 
2 Trans. A. I. E. E., vol. 66, 1947 (pp. 792-796). 
3 Trans. A. I. E. E., vol. 66, 1947 (pp. 1451-1459). 
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transmission over the line and restored to original condition at main termi! 
points. At intermediate points appropriate groups of channels may | 
removed, inserted, bridged, or relocated in the frequency spectrum of t! 
line. 

Experimental Studies of a Remodulating Repeater... W. M. Goopai 
his paper describes tests made on an experimental broad-band microway: 
f.m. repeater. A superheterodyne receiving unit is used with a microway: 
reflex-oscillator transmitting unit to form a repeater. An experimenta 


setup for testing this repeater in a circulating-pulse system is described 


Oscillograms showing the performance of the repeater on a multilink basis 


are discussed. 

An Electronic Regenerative Repeater for Teletypewriter Signals® R. B 
HrARN. The important factor in teletypewriter signal transmission over 
circuits is the relative position on a time scale of the code pulses. If this 
timing is preserved, wide amplitude variations can be experienced without 
errors resulting. Correctly timed signal pulses at the transmitting end 
of a circuit are not necessarily properly timed at the receiving end, as the 
transmission path may shift the timing of some transitions with respect to 
others. However, if the signals are not too badly changed or distorted, it is 
possible to retime them at an intermediate point and send them on in their 
original form. 

Many arrangements have been devised for automatically retiming and 
retransmitting teletypewriter signals. These arrangements are known as 
regenerative repeaters. A few of these have been designed to make use of 
electronic timing arrangements and the purpose of this paper is to describe 
such an electronic regenerative repeater, known as repeater TG-29, designed 
originally for use by the Armed Forces. 

Submarine Detection by Sonar.® A.C. KELLER. Sonar, the only effective 
method of detecting completely submerged submarines was a major factor 
in the defeat of the U-boat and the winning of the Battle of the Atlantic. 
A majority of the 996 enemy submarines sunk during World War II was 
detected and located by sonar. The word sonar is formed from the phrase 
SOund Navigation And Ranging and applies broadly to under water sound 
devices for listening, echo ranging, and locating obstacles. 

The QJA sonar system, one of those which got into active service during 
World War II, is described here. This equipment was designed by Bell 
lelephone Laboratories and manufactured by the Western Electric Com- 
pany. 


‘Proc. 1. R. E., May 1948 (pp. 580-583). 
5 Trans. 2 
® Trans. / 


E., vol. 66, 1947 (pp. 904-911). 


es 
1.7. E. E., vol. 66, 1947 (pp. 1217-1230). 
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Measurement of High Q Cavities al 10,000 Megacycles.. RK. W. LANG! 
Known methods of measuring Q in high Q resonant cavities, together wit! 


their accuracies and sources of error, are discussed. For relatively low 
values of Q and of frequency, it is shown that band width methods are more 

accurate than decrement methods. For values of 0 above 30,000 at fre 

quencies above 3,000 megacycles the reverse is true. The signilicant feature 
of the present method, the wide range heterodyne decrement method, is that 
the accuracy is improved by observing the decay over a relatively long 
interval of time. An absolute accuracy of plus or minus three per cent and 
a relative accuracy of plus or minus two per cent are achieved. Design 
features and performance are discussed and constructional details are 
presented. 

Absorbing Media for Underwater Sound Measuring Tanks and Baffles.* 
W. P. Mason and F. H. Hipsarp. By using absorbing walls surrounding 
a small body of water, measuring tanks have been produced which will de- 
termine the directional properties of underwater sound instruments down 
to a level of 25 db below the direct beam. These absorbing media are con- 
structed by inserting fine mesh screen or packed copper wadding in a viscous 
liquid such as castor oil. These obstructions result in an enhanced viscous 
action which is nearly independent of the frequency above 10 kilocycles. A 
six-inch wall can reduce the reflections by 20 db. Tanks using such ab- 
sorbing media were used for testing transducers at the manufacturing plant 
and were used for determining the approximate characteristics of small sized 
instruments. Absorbing media were also used in the sound transparent 
dome housing the transducer and in the back of the QJB transducers. 

Calculation of the Directivity Index for Various Types of Radiators.’ C. 
T. Motioy. This paper gives the derivations of the ‘‘directivity index” 
formulas for several types of sound radiators. The ‘directivity index”’ is 
defined as “‘the ratio of the total acoustic power output of the radiator to 
the acoustic power output of a point source producing the same pressure at 
the same point on the axis.” The utility of the directivity index concept is 
that it permits power calculations to be made for all radiators in the same 
manner as for point sources. Directivity index formulas, together with 
graphs covering practical cases, are given for the following types of radiators: 

1. General plane piston in infinite baffle, 

2. Circular plane piston in infinite baffle, 

3. Rectangular plane piston in infinite baffle, 

4. Sectoral horn, 

7 Trans. A. I. E. E., vol. 65, 1947 (pp. 161-166). 


8 Jour. Acous. Soc. Americu, July 1948 (pp.476-483) 
> Jour. Acous. Soc. America, July 1948 (pp. 387-405) 
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Ss. Multicellular horn, 


6. Piston set in sphere 

1 Magnetic Field Strength Meier Employing the Hall Effect in Germanium 
Gs. lL, Prarson. The instrument to be described measures magnetic fiel 
strengths as determined from the Hall effect in germanium. The essentia 
parts of this instrument include a small germanium probe, and a panel typ 
microammeter calibrated directly in gauss. Its accuracy is +2 percent at 
lields between 100 and 8000 gauss. At higher fields the readings are to: 
low, the error amounting to 9 percent at 20,000 gauss. The chief advantages 
of this instrument are: (a) small size and portability, (b) continuous reading 
rather than ballistic as in ordinary field strength meters, and (c) a small 
honmagnetic probe with which one can search in very narrow gaps. 

The Representation of Vowels and their Movements." Ravpu K. Porter 
and GorpoN E. Peterson. It is shown that movement of the major 
resonances in the voiced sounds of speech may be represented by traces in 
a three-dimensional graph. Apparently a great deal can be learned about 
speech through investigation of such traces, and they suggest a new method 
for vowel designation that is particularly adaptable to quantitative analysis 

General Mobile Telephone System.'* H. 1. RoMNes and R. R. O’COoNNok. 
Phe tremendous need for communication with ships, airplanes, trucks, tanks, 
and other mobile units used in such large quantities during the war accel 
erated the development of practical mobile radiotelephone equipment for 
use in the 30 to 200 megacycle range and emphasized the practicability and 
usefulness of mobile telephone service. By the end of the war the art had 
advanced sufficiently in the applications of these higher frequencies so that 
it seemed practicable to provide telephone service to mobile units on a gen 
eral basis rather than limit it to safety and emergency services as had been 
the case before the war. The Federal Communications Commission there- 
fore made available a few frequencies for experiments in this field. In the 
two years which have elapsed, the Bell System has made this service avail 
able on an experimental basis in more than 60 cities and about 100 more 
systems are under construction. This paper describes the arrangements 
used and outlines the experience obtained to date with this service. Im- 
provements are being made constantly so that this must be regarded as an 
interim report on a rapidly changing and expanding service. 

Interference between Very-High-Frequency Radio Communication Circuils.”” 
W. Rar Youn, Jr. Interference between different radio circuits is an old 
problem, one which in the past generally has been solved by trial and error 


0 Rev. Sci. Instruments, April 1948 (pp. 263-265) 

Jour. Acous. Soc. America, July 1948 (pp. 528-535) 

“Trans. A. 1. E. E., vol. 66, 1947 (1658-1666). 

‘8 Proc. 1. R. E., Waves and Electrons Section, July 1948 (pp. 923-930). 


— 








ind | 
the u 
feren 
the v 

Th 
powe 
of su 
Situe 

In 
enou 
the : 
spur 
ina 

Tl 
ferel 
relat 
whi 
is ce 
whi 
opel 

























if 





{BSTRACTS 751 


ind by hand tailoring (special filters, etc.). With the general increase in 
the usage of radio communication, however, the amount of potential inter 
ference is greatly increased. This paper will be concerned principally wit! 
the v.h.f. problem. 

There is generally a large difference between transmitting and receiving 
power levels. As a result, spurious radiations, spurious responses, and lack 
of sufficient receiver selectivity may in many instances cause interference 
Situations are described in which such interferences are likely. 

In mobile systems interference can occur if the interfering station is close 
enough to “capture” it from the desired signal. This, in turn, depends upon 
the selectivity and spurious response of the receiver and the amount of 
spurious radiation from the transmitter. The problem can be approached 
in a statistical manner. 

The types of spurious radio behavior which are common causes of inter 
ference are discussed. Sample measurements are given to illustrate the 
relative magnitude of the various modes of behavior. Formulas are given 
which permit computation of the frequency of the disturbances. A method 
is described for making charts suitable for a given type of equipment from 
which the spurious frequencies can be read directly as a function of the 


operating frequency. 
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100 MILLIMETERS 


INSTRUCTIONS Resolution is expressed in terms of the lines per millimeter recorded by a particular 
film under specified conditions. Numerals in chart indicate the number of lines per millimeter in adjacent 
“T-shaped” groupings. 

In microfilming, it is necessary to determine the reduction ratio and multiply the number of lines in the 
chart by this value to find the number of lines recorded by the film. As an aid in determining the reduction 
ratio, the line above is 100 millimeters in length. Measuring this line in the film image and dividing the length 
into 100 gives the reduction ratio. Example: the line is 20 mm. long in the film image, and 100/20 = 5. 


Examine “T-shaped” line groupings in the film with microscope, and note the number adjacent to finest 
lines recorded sharply and distinctly. Multiply this number by the reduction factor to obtain resolving power 
in lines per millimeter. Example: 7.9 group of lines is clearly recorded while lines in the 10.0 group are 
not distinctly separated. Reduction ratio is 5, and 7.9 x § 39.5 lines per millimeter recorded satisfacto- 
rily. 10.0 x § = $0 lines per millimeter which are not recorded satisfactorily. Under the particular condi- 
tions, maximum resolution is between 39.5 and 50 lines per millimeter. 


Resolution, as measured on the film, is a test of the entire photographic system, including lens, exposure, 
processing, and other factors. These rarely utilize maximum resolution of the film. Vibrations during 
exposure, lack of critical focus, and exposures yielding very dense negatives are to be avoided. 
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